Questions tagged with Extract Transform & Load Data
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
Hi All,
I have some issues when running my glue job, I landed my pipe delimited csv file in a s3 bucket and after running the crawler pointing to the folder where the file is placed, a glue catalog...
1
answers
0
votes
1699
views
asked a year agolg...
Looking for a little insight on this:
ETL job fails with the following error: “Connection reset on https://xxxxxx.com:9200 https://xxxxxxx.com:9200"
Basic information:
*ETL job is pulling data...
1
answers
0
votes
223
views
asked a year agolg...
Hey Guys!
I am trying to Read a large amout of data(About 45GB in 5.500.000 files) in S3 and rewrite in a partitioned folder (In another Folder inside the same Bucket) but I am facing this...
1
answers
0
votes
487
views
asked a year agolg...
I do a crawler to load all my S3 csv files to Glue Data Catalog. Now I want to create a glue job to execute ETL (create and drop temporary tables, select and insert data to tables in Data Catalog) But...
0
answers
0
votes
59
views
asked a year agolg...
I can't find a proper way of setting the correct data type for a timestamp attribute on my Athena table **parquet** in order to query for time intervals.
im creating the table via a crawler on parquet...
1
answers
0
votes
507
views
asked a year agolg...
I have written a lambda function to convert json file in raw s3 bucket into parquet file and gets uploaded directly it to the cleansed s3 bucket. I cannot delete json files since i want to convert...
2
answers
0
votes
1340
views
asked a year agolg...
I'm trying to fill Aurora MySQL DB from CSV in an S3 bucket using the manual [Loading data into an Amazon Aurora MySQL DB cluster from text files in an Amazon S3...
1
answers
0
votes
220
views
asked a year agolg...
Hello, I'm creating a Glue Job using Jupyter notebook and I'm currently using Ray as the ETL type.
After running the job once, I noticed I can no longer save my notebook or push it to a repository...
1
answers
0
votes
389
views
asked a year agolg...
Running a glue job to fetch records from Microsoft sql server but glue jobs keeps running and does not show any results. Job is scheduled with G.2X worker with 5 works with auto scheduling.
Logs:-...
2
answers
0
votes
1171
views
asked a year agolg...
I have noticed that my AWS Step Function freezes after previous run has been aborted with some jobs still left in the pending state.
The use case:
I have a step function that increments through the...
0
answers
0
votes
73
views
asked a year agolg...
Join Tables in AWSlg...
I want to join two tables.I have the tables in CSV format stored in S3 bucket
1.Is Amazon Glue studio,the right option?
2.What is the correct procedure?
3.What are the IAM permissions...
2
answers
0
votes
316
views
asked a year agolg...
Hi,
I am using GlueETL version Spark 3.0 with Python version ![Glue Job Details](/media/postImages/original/IMGyPWz2XIS_-4GohAdHXVpw)
The ETL job has only 2 steps. I am using...
1
answers
0
votes
236
views
asked a year agolg...