Questions tagged with Extract Transform & Load Data

Content language: English

Select up to 5 tags to filter

Sort by most recent

Filter Questions by

AllAnsweredUnansweredNo Answer

Browse through the questions and answers listed below or filter and sort to narrow down your results.

how to skip null value when mapping in aws glue etl job

When I started an etl job and mapped one table to a s3 bucket and change some data type. I got two columns empty because these two columns included some null value, how can I skip the null value in...

Extract Transform & Load Data

answers

votes

views

rePost-User-1836161

asked a year ago

CSV to Parquet using AWS Glue

I converted a CSV(from S3) to parquet(to S3) using AWS glue and the file which is converted to Parquet was named randomly .How do i choose the name of the file that is to be converted to Parquet from...

Amazon Simple Storage Service AWS Storage Gateway Amazon Athena AWS Glue Extract Transform & Load Data

answers

votes

784

views

rePost-User-1675689

asked a year ago

AWS macie reveal samples

Macie provides detailed positions of sensitive data in output file. But, I want to extract that data using positions from output file. Also, macie reveal only 10 samples. Is there any way to get more...

Amazon Athena Amazon Macie Amazon QuickSight Storage Extract Transform & Load Data

answers

votes

329

views

rePost-User-0027992

asked a year ago

pyspark dataframe in glue notebook mode=overwrite creates empty files at each path level

I'm writing partitioned parquet data using a Spark data frame and mode=overwrite to update stale partitions. I have this set: spark.conf.set('spark.sql.sources.partitionOverwriteMode','dynamic') The...

AWS Glue Extract Transform & Load Data

answers

votes

874

views

rePost-User-6463058

asked a year ago

Glue Jupyter Job , Execution Class

How can one set up an Execution Class = FLEX on a Jupyter Job Run , im using the %magic on my %%configure cell like below and also setting the input arguments with --execution_class = FLEX But still...

Analytics AWS Glue Extract Transform & Load Data

answers

votes

606

views

Jorge Vidinha

asked a year ago

AWS Athena support for microseconds in the TIMESTAMP data type

Hi, I'd appreciate AWS Athena support for TIMESTAMP data type with microsecond precision for all row formats and table engines. Currently, the support is very inconsistent. See the SQL script below....

Amazon Athena Analytics Database Extract Transform & Load Data

answers

votes

155

views

zsvoboda

asked a year ago

Help with HIVE_BAD_DATA: No columns in row group error

Started getting this error today when querying data from Athena in a table created from parquet files in our S3 bucket: ![Enter image description...

Amazon Athena Extract Transform & Load Data

answers

votes

100

views

rePost-User-6273709

asked a year ago

AWS Glue Mongodb Atlas Connection Works For Crawler But Not For ETL Job?

Hi community, I am trying to perform an ETL job using AWS Glue. Our data is stored in MongoDB Atlas, inside a VPC. Our AWS is connected to our MongoDB Atlas using VPC peering. To perform the ETL...

Accepted AnswerAnalytics Amazon VPC Database AWS Glue Extract Transform & Load Data

answers

votes

455

views

rePost-User-0675162

asked a year ago

While trying to Update a table present in one database using a table present in another database, I'm getting an ASSERT error

In Redshift, I'm trying to update a table using another table from another database. The error details: SQL Error [XX000]: ERROR: Assert Detail: ----------------------------------------------- ...

Analytics Database Extract Transform & Load Data Amazon Redshift

answers

votes

242

views

rePost-User-7443943

asked a year ago

AWS Data Pipeline to move a CSV file from my computer to AWS Data Lake as a parquet file

I'm attempting to use AWS Data Pipeline to move a CSV file from my computer to AWS Data Lake as a parquet file. I'm unable to finad the exact template to select to migrate from my local...

Migration & Modernization AWS Data Pipeline Storage Extract Transform & Load Data

answers

votes

views

rePost-User-1675689

asked a year ago

i want to directly move a csv file from my laptop to aws data lake using aws pipeline? is it possible to so ? if yes how?

Migration & Modernization Storage Extract Transform & Load Data

answers

votes

336

views

rePost-User-1675689

asked a year ago

AWS Glue adding S3 input_file_name and persisting that in RDS

Hey, My ETL Glue job is: 1. reading from Data Catalogue (S3 based), 2. selecting specific fields from the input file (which is json) 2. doing some mapping 3. saving output data to Postgres Aurora...

AWS Glue Extract Transform & Load Data

answers

votes

342

views

pkantor

asked a year ago