Questions tagged with Extract Transform & Load Data

Content language: English

Select up to 5 tags to filter

Sort by most recent

Filter Questions by

AllAnsweredUnansweredNo Answer

Browse through the questions and answers listed below or filter and sort to narrow down your results.

Only first record from JSON array in s3 is being retrieved from Athena

I'm having the same issue. Data is stored in below format in s3 as JSON array with partitions S3 path - s3://fleet-fuelcard-data-import-dev/lambda/fuelsoft-morgan/660306/2024/Apr/03-Apr-2024.json....

Amazon Athena Analytics Extract Transform & Load Data S3 Select

answers

votes

views

ShaXXaws

asked 3 days ago

Does AWS Glue Support Struct and Array Data Types in Schema Changes?

In AWS Glue jobs, within the Targets node, I am unable to see the data types such as struct, array or map while changing the schema. Does AWS Glue not support these data types?

AWS Data Pipeline AWS Glue Extract Transform & Load Data

answers

votes

134

views

saulgoodman

asked 5 days ago

Error 'IllegalArgumentException: No group with name <host>' in AWS Glue ETL Job from RDS to Snowflake

I've successfully set up AWS Glue with an RDS database serving as the data source and a Snowflake database as the data target. In this setup, I've configured AWS Glue crawlers to catalog the metadata...

Analytics Database AWS Glue Extract Transform & Load Data

answers

votes

158

views

asked 6 days ago

Export Error with Oracle RDS

Hi, When I try to export (expdp) a database schema in Oracle RDS using SQL Developer, I get the error below related to dump file size. This seems to be caused by default FILESIZE parameter in AWS...

Accepted AnswerAmazon Relational Database Service AWS Management Console Extract Transform & Load Data Oracle RDS Custom for SQL Oracle

answers

votes

247

views

sam15

asked 8 days ago

Data Inconsistency in Datalake: Glue Job Bookmark Issue

Hello, I am relatively new to Glue and encountering some challenges with Glue ETL. Our setup involves a datalake that retrieves data from a backend database as its source. This datalake is...

Amazon Athena Analytics Database AWS Glue Extract Transform & Load Data

answers

votes

200

views

Philip

asked 11 days ago

Enforce column type in Glue Crawler

Hello, I have parquets files in S3 that i parse using Glue Crawler and query in Athena. I found that some files have two columns "x" and "y" that have a type **int64** while other files have them as...

Amazon Athena AWS Glue Extract Transform & Load Data

answers

votes

197

views

Mehdi

asked 12 days ago

Glue - Change Schema - Dropdown for target key or custom visual transform

In our ETL process we are building out a pipeline where someones job is to take input files (ex. csv) and map the columns to existing column names. After the mapping is complete a glue workflow will...

AWS Glue Extract Transform & Load Data

answers

votes

169

views

aws_explorer

asked 13 days ago

Read the input file name from S3 in AWS Glue into redshift

I am reading multiple files from S3 and writing the output to Redshift DB. Below is my code to read all the files from a S3 location (s3://abc/oms/YFS_CATEGORY_ITEM/) ``` yfs_category_item_df =...

Amazon Simple Storage Service Analytics AWS Glue Extract Transform & Load Data Amazon Redshift

answers

votes

475

views

Joe

asked 18 days ago

Glue DynamoDB Writer - how to access unprocessed items?

We have a glue job that is writing large number of items to dynamo. **If a write to dynamo fails, how can we have access to these individual failed records in order to attempt to resolve and...

AWS Glue Amazon DynamoDB Extract Transform & Load Data

answers

votes

272

views

struesda

asked 20 days ago

Querying LZ4 compressed file on s3 using AWS Athena

Hi I have created an external table on AWS Glue catalog db . The table points to a lz4 compressed file on an s3. the table definition looks like this ``` CREATE EXTERNAL TABLE `myapplogs`( ...

Amazon Athena Analytics AWS Glue Extract Transform & Load Data

answers

votes

276

views

Pradeep

asked 20 days ago

Why doesn't Glue Job and Glue Workflow have the function of version control and alias likes Labmda.

I tried to develop the data orchestlation with s3, Glue Job and Glue Workflow. After I developed it, I found that Glue Job and Glue Workflow doesn't have the function of version control and alias...

AWS Glue Extract Transform & Load Data

answers

votes

169

views

sympa shun

asked 24 days ago

Best practices for data ingestion - csv or mongodb

Hi team, first post, let me know if it provides a good explanation. I'd like to know a way to minimize the effort for data ingestion. We have two options as follows: (1) csv files from a file...

Analytics AWS Lambda AWS Glue Extract Transform & Load Data

answers

votes

296

views

Felipe Vaz

asked 24 days ago

1
2
3
4
5
•••
52
12 / page