Questions tagged with Extract Transform & Load Data

Content language: English

Select up to 5 tags to filter

Sort by most recent

Filter Questions by

AllAnsweredUnansweredNo Answer

Browse through the questions and answers listed below or filter and sort to narrow down your results.

Reading S3 objects whose basename begins with an underscore as a Glue DynamicFrame

I have JSON data stored on S3 which I have created Glue tables over. This data is partitioned and I use Glue crawlers to update the table partitions. I then load this data as a Glue DynamicFrame...

Accepted AnswerAmazon Athena AWS Glue Extract Transform & Load Data

answers

votes

481

views

bridgedownstream

asked a year ago

AWS Glue PII detector job taking too much time

I have an AWS Glue PII data detector job, its taking around 47 minutes to complete for 17.9 MB file size which is very long time for any spark job. Sharing the code snippet used in the...

Accepted AnswerAWS Glue Extract Transform & Load Data

answers

votes

440

views

rePost-User-0309502

asked a year ago

HIVE_UNKNOWN_ERROR: Path is not absolute: s3://deproject-on-youtube-athena-job-output This query ran against the "db_youtube_cleaned" database, unless qualified by the query. Please post the error mes

I have ran the python query to transform the json format to parquet format and it was completed successfully, I can see the parquet file and the columns, but when I try to run the query using Athena,...

Amazon Athena AWS Lambda AWS Glue Extract Transform & Load Data MySQL

answers

votes

821

views

Sai Gopi Krishna

asked a year ago

New to AWS, trying to create a table through glue crawler from a .json file that i uploaded into S3.

Hello, any help would be much appreciated. I have two files that I need to make tables for one is a csv file that I was able to get the table loaded for through glue crawler. The other file i was not...

Amazon Athena AWS Glue Extract Transform & Load Data

answers

votes

1122

views

rePost-User-8932083

asked a year ago

Amazon Connect - For reports generated from saved templates, columns randomly show up in different order, have a different format, or have a leading space

We have a number of saved report templates and when we look at the generated csv files, there are a few problems. 1. the columns change order randomly 2. in the csv some of the column headers (such...

Amazon Connect Extract Transform & Load Data

answers

votes

261

views

ChrisConnect

asked a year ago

AWS Glue creating Null for Empty data

I am using Amazon Kinesis Firehose for converting files from JSON to Parquet leveraging Glue for Table creation. When the data is blank the glue schema creates a NULL and the conversion at Kinesis...

Analytics Extract Transform & Load Data Amazon Kinesis

answers

votes

2149

views

rePost-User-2987011

asked a year ago

EMR spark no module named

Hi, I'm trying to run a python job on EMR with some dependencies installed with venv as following ``` python -m venv pyspark_venv source pyspark_venv/bin/activate pip install pyarrow pandas...

Accepted AnswerAnalytics Linux Provisioning Amazon EMR AWS Batch Extract Transform & Load Data

answers

votes

876

views

Paolo

asked a year ago

ETL Influx2 with Glue

Hey all! Wondering if anybody has some experience what the best way is to ETL data from Influx2 to my AWS S3 Data lake. I have been looking for influx2 jdbc connectors (I have used these for PG) but...

Analytics AWS Glue Extract Transform & Load Data

answers

votes

views

D Joe

asked a year ago

AWS Glue writing to S3 but not creating table

Hey! I have a setup currently with a crawler that connects to a PostgreSQL database with JDBC, this works and the crawler generates around 20 tables for this database. I now want to create an ETL job...

Amazon Athena Analytics AWS Glue Extract Transform & Load Data

answers

votes

938

views

D Joe

asked a year ago

ETL using Aws Data Glue

I want to extract data from my PostgreSQL on RDS using Aws data glue, transform the data and export the data to s3 bucket. how do i do that. i need an AWS tutorial on this.

Amazon Simple Storage Service PostgreSQL AWS Glue Extract Transform & Load Data

answers

votes

1086

views

Adesoji

asked a year ago

Error when running a glue job to write data to data lake

I am writing a glue script to take data from s3(PSQL WAL LOGS) to write that data into a hudi data lake. Whenever I am trying to do that I am getting unable to upsert data with commit time error with...

Amazon Athena Analytics AWS Glue Extract Transform & Load Data

answers

votes

744

views

rePost-User-5701621

asked a year ago

How can I run a post action script while writing to redshift from aws glue?

``` df.toDF().write.format("jdbc").\ option("url", "").\ option("dbtable", f"public.{tableName}_staging").\ option("user", "").\ option("password", "").\ ...

AWS Glue Extract Transform & Load Data Amazon Redshift

answers

votes

353

views

gang-gang

asked a year ago