By using AWS re:Post, you agree to the AWS re:Post Terms of Use

All Content tagged with AWS Glue

AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.

Content language: English

Select up to 5 tags to filter
Sort by most recent
1849 results
I'm trying to reproduce the exercise in the link below, from the AWS Glue Immersion day workshop, but I'm having a problem with a rule recommendation failure in Glue Data Quality, due to an ambiguous ...
1
answers
0
votes
24
views
asked 11 days ago
Im using DBT with glue adapter to create our datalake on AWS Glue. Im not able to query Iceberg and Non Iceberg tables. These tables are in same Glue Catalog (Same database). Im able to query non-iceb...
1
answers
0
votes
26
views
asked 11 days ago
I created a Glue job in the AWS console and wanted to connect it to GitLab (an enterprise account). I generated a Personal Access Token (PAT) for the private GitLab repository, granting it owner permi...
2
answers
0
votes
28
views
asked 11 days ago
I have a set of scripts that call a government hierarchy API and the rate limit is 4000/hr. I have a separate script for each function: 1. OAuth token retrieval and store to SSM 2. Get token and call ...
2
answers
0
votes
28
views
asked 11 days ago
I need to transfer data from a PostgreSQL database to a MySQL database using AWS Glue, with transformations applied to the data. The data source allows direct connectivity to PostgreSQL, but the targe...
1
answers
0
votes
21
views
asked 12 days ago
Hallo, I have created below Glue job in CDK which i have scheduled and which is running fine. But now i want to create another Glue job "pos table replication large" and want to schedule the job at...
1
answers
0
votes
25
views
asked 13 days ago
Hallo, I dont want to update the catalog when there is new column added or deleted in the jdbc source data table. When the crawler detects schema changes in the data store then it should Ignore the c...
1
answers
0
votes
32
views
asked 13 days ago
I was trying to build a Data pipeline with No-Code ETL using AWS Glue Studio, and I kept getting the error message "Node is Misconfigured", when I add the join node using Visual ETL (see attached). Un...
1
answers
0
votes
17
views
asked 13 days ago
Post-deployment of the CUDOS Dashboards, if you want to migrate to CUR 2.0, you have to deploy the Data Exports stack as found below https://catalog.workshops.aws/awscid/en-US/dashboards/foundational...
2
answers
0
votes
59
views
asked 14 days ago
There are multiple AWS Glue connections (also termed as Data Catalog Connection or Data Connections) created using AWS Glue Studio for different data sources. How can I generate the equivalent code fo...
2
answers
0
votes
29
views
asked 16 days ago
I have a s3 dataset that is cataloged in Glue. A new column was added to the s3 data and the crawler was re-crawled that updated the glue table definition to add the new column. This new column is vis...
1
answers
0
votes
28
views
asked 17 days ago
I am using Athena federated query with a Glue table to query IOT Data stored in Document DB. The time format reported by IOT sensors is ISO 8601 (here the entry from DocumentDB "time" : "2024-11-20T0...
1
answers
0
votes
36
views
AWS
asked 18 days ago