By using AWS re:Post, you agree to the AWS re:Post Terms of Use

All Content tagged with AWS Glue

AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.

Content language: English

Select up to 5 tags to filter
Sort by most recent
1848 results
Good afternoon, I'm currently working within S3, Athena, and Glue, and I've run into a problem when uploading new data into S3. Whenever I upload new data into S3 manually, there's a mismatch in how ...
3
answers
0
votes
92
views
asked 2 months ago
Hi I have a glue job requirement which is processing CSV file from s3 and insert each record on MySQL RDS table. Traffic requirement - 1 file per day Currently we have fixed dpu setup, just want to k...
2
answers
0
votes
38
views
asked 2 months ago
I'm trying to connect to snowflake from AWS Glue. I've created a snowflake connection as stated in link below. [https://docs.aws.amazon.com/glue/latest/dg/aws-glue-programming-etl-connect-snowflake-ho...
2
answers
0
votes
118
views
asked 2 months ago
Hi, having troubles querying Iceberg history table using PySpark from AWS Glue 4.0, *same queries working in the Athena web query editor*. **The query that failing:** ``` def get_last_snapshot_id_by_...
1
answers
0
votes
96
views
asked 2 months ago
I have a redshift external schema named example_ext_schema pointing to Glue data catalog. I would like to add an additional table to the "example_ext_schema", can I add it ?
1
answers
0
votes
46
views
asked 2 months ago
I am encountering an IllegalArgumentException while running a PII detection script in AWS Glue. The error message indicates that an invalid column name is being used in sourceColumns. ``` S3bucket_n...
2
answers
0
votes
49
views
asked 2 months ago
we have a daily glue job to get athena results to s3, the job has been working and today's run we got an error, the same query works in Athena UI Glue error: id: <redacted> Error Category: UNCLASSIF...
3
answers
0
votes
75
views
asked 2 months ago
I want to provide the schema when exporting a mongodb collection in AWS glue. So I've created a crawler which successfully crawled my collection. This proves to me that the crawler can reach mongodb a...
2
answers
0
votes
63
views
asked 2 months ago
1. I am trying to create data catalog table in AWS glue which has different cols and their datatypes (varchar,date,timestamp). and the respective values are stored in CSV format in S3. But while query...
1
answers
0
votes
73
views
asked 2 months ago
I have AWS Glue Crawlers that crawls databases for table's metadata in Snowflake via JDBC connection. It will crawl table's fields and types to AWS Glue but it will not bring the fields **descriptions...
2
answers
0
votes
77
views
asked 2 months ago
1) RDS oracle DB is created in account A and publicly access is enabled 2) Oracle DB can be accessed via SQL Developer using Hostname = *<Endpoint>* of oracle DB in step 1 Port = 1521 SID = xx 3) Unde...
2
answers
0
votes
246
views
asked 2 months ago
Hello, I’m seeking guidance and suggestions on cost-effective methods for scanning a couple of DynamoDB tables, each with sizes of up to 3 TB and 5 TB. Our goal is to join these tables based on cert...
2
answers
0
votes
98
views
asked 2 months ago