Questions tagged with Extract Transform & Load Data

Content language: English

Select up to 5 tags to filter

Sort by most recent

Filter Questions by

AllAnsweredUnansweredNo Answer

Browse through the questions and answers listed below or filter and sort to narrow down your results.

glue job - Issues when reading from the glue catalog table using dynamic frame

Hi All, I have some issues when running my glue job, I landed my pipe delimited csv file in a s3 bucket and after running the crawler pointing to the folder where the file is placed, a glue catalog...

AWS Glue Extract Transform & Load Data

answers

votes

1699

views

Pradeep

asked a year ago

ETL Connection reset error when pulling data from RDS DB

Looking for a little insight on this: ETL job fails with the following error: “Connection reset on https://xxxxxx.com:9200 https://xxxxxxx.com:9200" Basic information: *ETL job is pulling data...

Extract Transform & Load Data

answers

votes

223

views

Jennifer

asked a year ago

Exception in User Class: com.amazonaws.SdkClientException : Unable to execute HTTP request: readHandshakeRecord AWS GLUE

Hey Guys! I am trying to Read a large amout of data(About 45GB in 5.500.000 files) in S3 and rewrite in a partitioned folder (In another Folder inside the same Bucket) but I am facing this...

AWS Glue Extract Transform & Load Data

answers

votes

487

views

lp_evan

asked a year ago

execute multiple sql statements against data catalog tables

I do a crawler to load all my S3 csv files to Glue Data Catalog. Now I want to create a glue job to execute ETL (create and drop temporary tables, select and insert data to tables in Data Catalog) But...

Amazon Athena AWS Glue Extract Transform & Load Data

answers

votes

views

rePost-User-3320844

asked a year ago

An error has been thrown from the AWS Athena client. SYNTAX_ERROR: line 2:43: '2023-02-07T23:59:59.613000+00:00' is not a valid timestamp literal

I can't find a proper way of setting the correct data type for a timestamp attribute on my Athena table **parquet** in order to query for time intervals. im creating the table via a crawler on parquet...

Accepted AnswerAmazon Athena Analytics AWS Glue Extract Transform & Load Data

answers

votes

507

views

Jorge Vidinha

asked a year ago

Athena Error- HIVE_BAD_DATA: Not valid Parquet file: s3://deng-utube-raw-us-east-1-dev/youtube/raw_stats_reference_data/FR_category_id.json expected magic number: PAR1 got: ] }

I have written a lambda function to convert json file in raw s3 bucket into parquet file and gets uploaded directly it to the cleansed s3 bucket. I cannot delete json files since i want to convert...

Accepted AnswerAmazon Simple Storage Service Amazon Athena AWS Lambda Extract Transform & Load Data

answers

votes

1340

views

rePost-User-2028403

asked a year ago

Unknown system variable using SET parameter in LOAD DATA FROM S3 expression

I'm trying to fill Aurora MySQL DB from CSV in an S3 bucket using the manual [Loading data into an Amazon Aurora MySQL DB cluster from text files in an Amazon S3...

Extract Transform & Load Data Aurora MySQL

answers

votes

220

views

Andrew

asked a year ago

Hi, I'm encountering an unfamiliar behavior with Glue Notebooks wherein the Glue version downgrades from Glue Version 4.0 to Glue Version 3.0.

Hello, I'm creating a Glue Job using Jupyter notebook and I'm currently using Ray as the ETL type. After running the job once, I noticed I can no longer save my notebook or push it to a repository...

AWS Glue Extract Transform & Load Data

answers

votes

389

views

rePost-User-4831034

asked a year ago

Glue job keeps running but does not write results

Running a glue job to fetch records from Microsoft sql server but glue jobs keeps running and does not show any results. Job is scheduled with G.2X worker with 5 works with auto scheduling. Logs:-...

Analytics High Performance Compute AWS Glue Extract Transform & Load Data

answers

votes

1171

views

Himanshu

asked a year ago

Step Function freezes

I have noticed that my AWS Step Function freezes after previous run has been aborted with some jobs still left in the pending state. The use case: I have a step function that increments through the...

AWS Step Functions Extract Transform & Load Data

answers

votes

views

Denys

asked a year ago

Join Tables in AWS

I want to join two tables.I have the tables in CSV format stored in S3 bucket 1.Is Amazon Glue studio,the right option? 2.What is the correct procedure? 3.What are the IAM permissions...

Analytics AWS Glue Extract Transform & Load Data

answers

votes

316

views

rePost-User-7645024

asked a year ago

Glue ETL converts NULL values of Snowflake columns to 0

Hi, I am using GlueETL version Spark 3.0 with Python version ![Glue Job Details](/media/postImages/original/IMGyPWz2XIS_-4GohAdHXVpw) The ETL job has only 2 steps. I am using...

AWS Glue Extract Transform & Load Data

answers

votes

236

views

Kyle Ahn

asked a year ago