Help improve AWS Support Official channel in re:Post and share your experience - complete a quick three-question survey to earn a re:Post badge!
All Content tagged with AWS Glue
AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.
Content language: English
Select tags to filter
Sort by most recent
2010 results
I created a connection in **AWS Glue Data Connections**, but it **failed**. After that, I **deleted the connection**, and the deletion was **successful**.
However, the **UI did not refresh** with t...
Hi AWS Experts,
I am working on an AWS Glue job to transfer data from an on-premise SQL Server to AWS RDS (SQL Server). My goal is to achieve a 20-second execution time, but despite multiple optimiza...
I'm trying to do some partition evolutions on my iceberg tables. It seems like `ALTER TABLE ... DROP PARTITION FIELD ...` is not supported by Athena and/or spark sql queries.
Is that correct, or am I ...
Hi,
I'm having a strange issue. I have two S3 buckets or DataCatalog tables to join.
When I query them on Athena they sure gives me results when I join them.
 instance. I have created and run a Crawler for a table to create a corresponding table in my Glue Data Catalog. I have been unable to read from ...

I get error below when I try to use a custom regex pattern to detect sensitive data in Glue Studio
Error Category: UNCLASS...
EXPERT
published 2 months ago0 votes181 views
This spotlight on AWS Glue equips you with the skills and troubleshooting tips to get the most out of a data integration service.
Can I enable optimization in AWS Glue when I create an iceberg table? Is there an option like a tableProperty?
df.writeTo(f"{table_name}") \
.tableProperty("format-version", "...
I want to run the two database table in parallel in AWS Glue job. Currently the database table run sequentially one after another and it also increase running time of AWS Glue job
Hi,
I use Athena to query tables from glue data catalog stored as JSON and with many partitions. However, the number of records is not correct when I select * from tables even though the data scanned...
I am working on a process that is as follows:
1. New S3 file is PUT in a bucket/folder (i.e. bucket_name/folder_name/file_name.json)
2. Lambda 1 is triggered by S3 PutObject and sends a message contai...