Questions tagged with AWS Glue

Content language: English

Select up to 5 tags to filter

Sort by most recent

Filter Questions by

AllAnsweredUnansweredNo Answer

Browse through the questions and answers listed below or filter and sort to narrow down your results.

Insufficient Lake Formation permission(s) on mock_data_patient (Database name: crawl_db, Table Name: mock_data_patient)

Crawler Error: Insufficient Lake Formation permission(s) on mock_data_patient (Database name: crawl_db, Table Name: mock_data_patient) (Service: AWSGlue; Status Code: 400; Error Code:...

Accepted AnswerAWS Identity and Access Management AWS Glue AWS Lake Formation

answers

votes

166

views

Omkar

asked 3 months ago

Glue Visual ETL: Can't copy raw data from RDS MySQL to S3 bucket due to unclassified error: Schema specified that header line is to be written; but contains no column names

I'm trying to build an ETL pipeline with AWS Glue, and the first step is to copy raw data from the original source to a staging bucket. The job is rather simple: source is a data catalog table (from...

Accepted AnswerAWS Glue Extract Transform & Load Data

answers

votes

221

views

NLopeDeBarrios

asked 3 months ago

Glue ETL AccessDeniedException for not existent Lake Formation

Hello, In a Glue ETL made of nodes: Amazon S3, Change Schema, AWS Glue Data Catalog with the table "us_spending" backed by S3, I have the following error: > Error Category: PERMISSION_ERROR;...

Accepted AnswerAWS Glue Extract Transform & Load Data AWS Lake Formation

answers

votes

184

views

rePost-User-4717319

asked 3 months ago

Pass parameter from one glue job to another in step function?

I am looking for the best way to pass a parameter from one glue job to another within a step function. Each day, I will receive a file. In the file there will be data for certain dates. The first...

AWS Step Functions AWS Glue Extract Transform & Load Data

answers

votes

605

views

rpost

asked 3 months ago

Using AWS Glue to export ~500TB of DynamoDB table to S3 bucket

We have use case where we want to export ~500TB of DynamoDb data to a S3, one of the possible approaches that I found was making use of AWS Glue Job. Also while exporting the data to S3, we need to...

AWS Glue

answers

votes

242

views

shasnk

asked 3 months ago

AWS GLUE - Big Query Connector Errors

I have issue in trying to set up custom query on glue studio for Big Query. For example, the query below works on BQ, but doesn't work on the custom query on glue studio. ``` SELECT * FROM...

AWS Glue

answers

votes

views

asked 3 months ago

Glue to BigQuery JSON field

I need to load data from my dataframe to BiqQuery table JSON type field. There is connector that according to documentation supports this feature:...

AWS Glue

answers

votes

views

Adas Kavaliauskas

asked 3 months ago

Support for Record Matching transforms in Glue 3.0/4.0

I'm working on a project that makes use of Glue Record Matching transforms which, by my best research though AWS docs, is only supported in Glue 2.0 jobs (and additionally, the maximum Glue version I...

AWS Glue

answers

votes

views

bryanhart

asked 3 months ago

Delete old parquet files of overwritten Iceberg table

I am trying to write a pyspark dataframe to S3 and the AWS data catalog using the Iceberg format and the pyspark.sql.DataFrameWriterV2 with the createOrReplace function. When I write the same...

Amazon Athena AWS Glue

answers

votes

573

views

Thomas Mueller

asked 3 months ago

Getting Error Category: UNCLASSIFIED_ERROR; An error occurred while calling o107.pyWriteDynamicFrame. Exception thrown in awaitResult: when running Glue job to transfer data from S3 to Redshift

Hi. I am trying to run an AWS Glue job where I transfer data from S3 to Amazon Redshift. However, I am receiving the following error: ``` Error Category: UNCLASSIFIED_ERROR; An error occurred while...

AWS Glue Extract Transform & Load Data Amazon Redshift

answers

votes

802

views

Matt_J

asked 3 months ago

Redshift Serverless -> Aurora Serverless Postgres (using AWS Glue)

I have a data pipeline built in Redshift Serverless, with some final tables being the result. We are also running a web app that I have set up an Aurora Serverless Postgres DB, to run from. The idea...

Aurora PostgreSQL AWS Glue Amazon Aurora Amazon Redshift Serverless

answers

votes

106

views

Dan

asked 3 months ago

HIVE_UNSUPPORTED_FORMAT: Unable to create input format

Can someone please help with this error? I have a csv file in an S3 bucket, created a crawler to update a table in glue, and the crawler runs but when I try to view the data in athena I get this...

Accepted AnswerAmazon Simple Storage Service Amazon Athena AWS Glue

answers

votes

532

views

Jessica Awesome

asked 3 months ago