Questions tagged with Extract Transform & Load Data

Content language: English

Select up to 5 tags to filter

Sort by most recent

Filter Questions by

AllAnsweredUnansweredNo Answer

Browse through the questions and answers listed below or filter and sort to narrow down your results.

AWS Glue - Surprising costs for data preview session

Hello, While building a job in AWS Glue (Amazon S3, Change Schema, AWS Glue Data Catalog), I had a surprising cost for data preview session (AWS Glue GlueInteractiveSession) of 91% of the total...

Accepted AnswerAWS Glue Extract Transform & Load Data

answers

votes

173

views

rePost-User-4717319

asked 3 months ago

ORA-39001: invalid argument value while importing data dump file in aws rds for oracle.

I am importing the data dump file that I have downloaded from S3. ``` -----load schema DECLARE v_hdnl NUMBER; BEGIN v_hdnl := DBMS_DATAPUMP.OPEN(operation => 'IMPORT', job_mode => 'SCHEMA',...

Amazon Relational Database Service Extract Transform & Load Data Oracle

answers

votes

914

views

Jit

asked 3 months ago

DELETE FROM does not work on AWS Data Catalog table

Hello, While trying to run this command `DELETE FROM "datasets"."us_spending"` in Athena, on a table from AWS Data Catalog, I had this error: ``` NOT_SUPPORTED: Cannot delete from non-managed Hive...

Accepted AnswerAmazon Athena Extract Transform & Load Data

answers

votes

647

views

rePost-User-4717319

asked 3 months ago

AWS Data Catalog table index not working

Hello, For an AWS Data Catalog table, I ran Glue (structure: Amazon S3 -> Change Schema -> AWS Glue Data Catalog ) and populate table with only string records. All the actions were done from the...

Accepted AnswerAWS Glue Extract Transform & Load Data

answers

votes

141

views

rePost-User-4717319

asked 3 months ago

Handle de-dup in Glue Job Pyspark

Hello I am using PySpark on Glue Job to do ETL on a table sourced from S3 And S3 sourced from mysql via DMS (table schema as below, column 'op', 'row_updated_timestamp' & 'row_commit_timestamp' are...

AWS Glue Extract Transform & Load Data

answers

votes

110

views

rePost-User-1943247

asked 3 months ago

Glue Visual ETL: Can't copy raw data from RDS MySQL to S3 bucket due to unclassified error: Schema specified that header line is to be written; but contains no column names

I'm trying to build an ETL pipeline with AWS Glue, and the first step is to copy raw data from the original source to a staging bucket. The job is rather simple: source is a data catalog table (from...

Accepted AnswerAWS Glue Extract Transform & Load Data

answers

votes

222

views

NLopeDeBarrios

asked 3 months ago

Glue ETL AccessDeniedException for not existent Lake Formation

Hello, In a Glue ETL made of nodes: Amazon S3, Change Schema, AWS Glue Data Catalog with the table "us_spending" backed by S3, I have the following error: > Error Category: PERMISSION_ERROR;...

Accepted AnswerAWS Glue Extract Transform & Load Data AWS Lake Formation

answers

votes

184

views

rePost-User-4717319

asked 3 months ago

Pass parameter from one glue job to another in step function?

I am looking for the best way to pass a parameter from one glue job to another within a step function. Each day, I will receive a file. In the file there will be data for certain dates. The first...

AWS Step Functions AWS Glue Extract Transform & Load Data

answers

votes

610

views

rpost

asked 3 months ago

Getting Error Category: UNCLASSIFIED_ERROR; An error occurred while calling o107.pyWriteDynamicFrame. Exception thrown in awaitResult: when running Glue job to transfer data from S3 to Redshift

Hi. I am trying to run an AWS Glue job where I transfer data from S3 to Amazon Redshift. However, I am receiving the following error: ``` Error Category: UNCLASSIFIED_ERROR; An error occurred while...

AWS Glue Extract Transform & Load Data Amazon Redshift

answers

votes

805

views

Matt_J

asked 3 months ago

is it possible to converta spark dataframe to dynamic frame and then using bookmark feature on the s3 folder used to read data in spark frame

``` df = spark.read.parquet("s3://folder/") df = df.withColumn('filename', input_file_name()) AmazonS3_node1697616892615 = DynamicFrame.fromDF(df, glueContext, "s3sparkread") ``` if this is the code...

Amazon Simple Storage Service AWS Glue Extract Transform & Load Data

answers

votes

272

views

asked 4 months ago

Circular import error while importing oracledb in AWS Glue

I'm trying to achieve data change capture using AWS Glue and don't want to use DMS. I'm trying to transfer data between two Oracle RDS instances which are in different AWS Account. Here I am trying to...

Amazon Simple Storage Service Analytics AWS Glue Extract Transform & Load Data

answers

votes

456

views

Abhijeet

asked 4 months ago