Questions tagged with Extract Transform & Load Data

Content language: English

Select up to 5 tags to filter

Sort by most recent

Filter Questions by

AllAnsweredUnansweredNo Answer

Browse through the questions and answers listed below or filter and sort to narrow down your results.

Data Wrangler PCA Error Py4JJavaError: breeze.linalg.NotConvergedException

I added PCA to my flow, I chose the numeric values, added a value for components and clicked preview. This error pops up: OperatorCustomerError: Py4JJavaError: breeze.linalg.NotConvergedException: I...

Extract Transform & Load Data Amazon SageMaker Data Wrangler

answers

votes

views

RobotimusPrime

asked a year ago

Finding the right big data solution

I work for a company that generates large amounts of business and sensor data and stores it in different databases (Prometheus, InfluxDB, Postgres, Timestream). Currently the querying for analytics...

Analytics Database AWS Glue Extract Transform & Load Data Amazon Redshift

answers

votes

373

views

D Joe

asked a year ago

AWS Glue Notebook issue when running SQL script

I am following the steps outlined in the link below: https://aws.amazon.com/blogs/big-data/introducing-native-delta-lake-table-support-with-aws-glue-crawlers/ (1) No issue with Query Delta Lake...

Accepted AnswerAWS Glue Extract Transform & Load Data

answers

votes

409

views

rePost-User-3769185

asked a year ago

Does Glue Crawler or catalog tables have 50 columns max limit?

I try to use Glue Crawler to read CSV files from S3 and create catalog table from it. Crawler run succesfully and it will create catalog table but those tables are empty (without columns) if I have...

AWS Glue Extract Transform & Load Data

answers

votes

974

views

rePost-User-1691104

asked a year ago

how to skip null value when mapping in aws glue etl job

When I started an etl job and mapped one table to a s3 bucket and change some data type. I got two columns empty because these two columns included some null value, how can I skip the null value in...

Extract Transform & Load Data

answers

votes

views

rePost-User-1836161

asked a year ago

CSV to Parquet using AWS Glue

I converted a CSV(from S3) to parquet(to S3) using AWS glue and the file which is converted to Parquet was named randomly .How do i choose the name of the file that is to be converted to Parquet from...

Amazon Simple Storage Service AWS Storage Gateway Amazon Athena AWS Glue Extract Transform & Load Data

answers

votes

762

views

rePost-User-1675689

asked a year ago

AWS macie reveal samples

Macie provides detailed positions of sensitive data in output file. But, I want to extract that data using positions from output file. Also, macie reveal only 10 samples. Is there any way to get more...

Amazon Athena Amazon Macie Amazon QuickSight Storage Extract Transform & Load Data

answers

votes

316

views

rePost-User-0027992

asked a year ago

pyspark dataframe in glue notebook mode=overwrite creates empty files at each path level

I'm writing partitioned parquet data using a Spark data frame and mode=overwrite to update stale partitions. I have this set: spark.conf.set('spark.sql.sources.partitionOverwriteMode','dynamic') The...

AWS Glue Extract Transform & Load Data

answers

votes

845

views

rePost-User-6463058

asked a year ago

Glue Jupyter Job , Execution Class

How can one set up an Execution Class = FLEX on a Jupyter Job Run , im using the %magic on my %%configure cell like below and also setting the input arguments with --execution_class = FLEX But still...

Analytics AWS Glue Extract Transform & Load Data

answers

votes

585

views

Jorge Vidinha

asked a year ago

AWS Athena support for microseconds in the TIMESTAMP data type

Hi, I'd appreciate AWS Athena support for TIMESTAMP data type with microsecond precision for all row formats and table engines. Currently, the support is very inconsistent. See the SQL script below....

Amazon Athena Analytics Database Extract Transform & Load Data

answers

votes

143

views

zsvoboda

asked a year ago

Help with HIVE_BAD_DATA: No columns in row group error

Started getting this error today when querying data from Athena in a table created from parquet files in our S3 bucket: ![Enter image description...

Amazon Athena Extract Transform & Load Data

answers

votes

views

rePost-User-6273709

asked a year ago

AWS Glue Mongodb Atlas Connection Works For Crawler But Not For ETL Job?

Hi community, I am trying to perform an ETL job using AWS Glue. Our data is stored in MongoDB Atlas, inside a VPC. Our AWS is connected to our MongoDB Atlas using VPC peering. To perform the ETL...

Accepted AnswerAnalytics Amazon VPC Database AWS Glue Extract Transform & Load Data

answers

votes

427

views

rePost-User-0675162

asked a year ago