Questions tagged with AWS Glue
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I need to fetch files that has arrived current_time - 1hr from my S3 bucket for processing. My files name will be in format yyyymmdd-hhmmsssss.parquet (includes milli seconds also). So I am running a...
1
answers
0
votes
420
views
asked 2 months agolg...
Hello AWS Support and Forum,
basic free tier account for some unknown reason. I've tried different regions and I still get the same 'AccessDeniedException: Account id... is denied access.' error! for...
2
answers
0
votes
139
views
asked 2 months agolg...
In the AWS console, we can see if a crawler run returned with changes in schema for particular tables:
![crawler ui](/media/postImages/original/IMVS_OmxwOTayK7TSm0yA_6g)
If I were to click on where...
1
answers
0
votes
439
views
asked 2 months agolg...
Hi everyone,
I've been running a Glue job smoothly for the past year, processing a collection of individual data blocks from Location A and transferring them to Location B. Each block shares the same...
0
answers
0
votes
94
views
asked 2 months agolg...
Hello
I am writing a glue script to transfer a table from DynamoDB to S3 bucket. I have put the necessary config into the code and enabled bookmark in Job Details and ran the script three times and...
3
answers
0
votes
183
views
asked 3 months agolg...
Metric log error - ThroughputMetricsSource: Metric: is already registered by a different accumulatorlg...
Hi, When running the same job concurrently, I see the below error in the logs, is there a way to resolve this error?
ThroughputMetricsSource: Metric:...
1
answers
0
votes
306
views
asked 3 months agolg...
I am trying to extract data from AWS RDS using AWS Glue. This RDS is using mariaDB engine and is in different account and VPC. When I am testing the Glue connection it is showing **successful**. Also...
3
answers
1
votes
495
views
asked 3 months agolg...
Hello team,
So, I built an ETL in python using pyspark. I have a bastion EC2 mysql database that is a copy of a production environment.
Every day it is copying the prod at round 2 oclock, and my...
1
answers
0
votes
208
views
asked 3 months agolg...
I'm running into an issue when reading a Glue Data Catalog data source in an Visual ETL AWS Glue job. An extra column is being added in called 'col40', which is not in the underlying file that was...
1
answers
0
votes
215
views
asked 3 months agolg...
hello, I am creating a dataframe consuming from a Glue Catalog table, this table has fields of type bigint, which can be null. It turns out that when this information is null, the dataframe ignores...
Accepted AnswerAWS Glue
1
answers
0
votes
151
views
asked 3 months agolg...
I was running glue job to process data from MariaDB inside VPC. Recently my glue job get "com.mysql.cj.jdbc.exceptions.CommunicationsException: Communications link failure" although it was running...
1
answers
2
votes
233
views
asked 3 months agolg...
Hello! According to the [documentation](https://docs.aws.amazon.com/glue/latest/dg/aws-glue-programming-etl-connect-kinesis-home.html), it should be possible to write data to Kinesis from Glue...
2
answers
0
votes
1231
views
asked 3 months agolg...