Questions tagged with AWS Glue
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I need to replicate an iceberg datalake stored in S3 from one bucket to another. However, multi-region access point doesn't work with Athena table. And I don't see any pyspark procedure that could...
1
answers
0
votes
360
views
asked 2 months agolg...
When creating a table with a leading space in the name, AWS Glue does not display any error during the creation process. However, upon checking the API logs, I noticed that the space is present in the...
1
answers
0
votes
431
views
asked 2 months agolg...
Hello,
I new at glue and created a job to extract the content on a table in oracle (on premise) to S3, the oracle conection is sucessfull but whe trying to write to S3 it says:
An error occurred...
4
answers
0
votes
114
views
asked 2 months agolg...
I need to fetch files that has arrived current_time - 1hr from my S3 bucket for processing. My files name will be in format yyyymmdd-hhmmsssss.parquet (includes milli seconds also). So I am running a...
1
answers
0
votes
408
views
asked 2 months agolg...
Hello AWS Support and Forum,
basic free tier account for some unknown reason. I've tried different regions and I still get the same 'AccessDeniedException: Account id... is denied access.' error! for...
2
answers
0
votes
127
views
asked 2 months agolg...
In the AWS console, we can see if a crawler run returned with changes in schema for particular tables:
![crawler ui](/media/postImages/original/IMVS_OmxwOTayK7TSm0yA_6g)
If I were to click on where...
1
answers
0
votes
425
views
asked 2 months agolg...
Hi everyone,
I've been running a Glue job smoothly for the past year, processing a collection of individual data blocks from Location A and transferring them to Location B. Each block shares the same...
0
answers
0
votes
91
views
asked 2 months agolg...
Hello
I am writing a glue script to transfer a table from DynamoDB to S3 bucket. I have put the necessary config into the code and enabled bookmark in Job Details and ran the script three times and...
3
answers
0
votes
167
views
asked 2 months agolg...
Metric log error - ThroughputMetricsSource: Metric: is already registered by a different accumulatorlg...
Hi, When running the same job concurrently, I see the below error in the logs, is there a way to resolve this error?
ThroughputMetricsSource: Metric:...
1
answers
0
votes
262
views
asked 2 months agolg...
I am trying to extract data from AWS RDS using AWS Glue. This RDS is using mariaDB engine and is in different account and VPC. When I am testing the Glue connection it is showing **successful**. Also...
3
answers
1
votes
484
views
asked 2 months agolg...
Hello team,
So, I built an ETL in python using pyspark. I have a bastion EC2 mysql database that is a copy of a production environment.
Every day it is copying the prod at round 2 oclock, and my...
1
answers
0
votes
187
views
asked 2 months agolg...
I'm running into an issue when reading a Glue Data Catalog data source in an Visual ETL AWS Glue job. An extra column is being added in called 'col40', which is not in the underlying file that was...
1
answers
0
votes
203
views
asked 2 months agolg...