Unanswered Questions tagged with AWS Glue
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I am trying to use the aws dynamodb export to s3 but when I read the data in glue. the value of an entire column is being received as null.
I have tried multiple times doing same thing. And I ran a...
0
answers
0
votes
102
views
asked 9 months agolg...
[CDK] Create a Glue Trigger that triggers Glue Crawler after Glue Job is finished successfully.lg...
I want to build a Glue Trigger that triggers Glue Crawler after Glue Job is finished successfully.
I looked over the cfnTrigger and wrote a code for it.
After CDK DEPLOY and finishing Glue Job...
0
answers
0
votes
103
views
asked 9 months agolg...
Using Athena on an s3 bucket that's been crawled and get the error:
class org.apache.parquet.io.GroupColumnIO cannot be cast to class org.apache.parquet.io.PrimitiveColumnIO
I've narrowed down the...
0
answers
0
votes
81
views
asked 9 months agolg...
Hi Team,
I am trying to archive the mongodb data to S3 as a parquet format, so that i have created spark script for that, When i am execute the spark script getting below error. How to resolve this...
0
answers
0
votes
121
views
asked 9 months agolg...
Hi all,
I've noticed some limitations while using Glue Workflows, that I'd like to suggest or possibly hear if there are alternatives.
1) Suppose you have job C depending on both jobs A and B...
0
answers
0
votes
219
views
asked 9 months agolg...
I have a raw bucket which performs read using glue job and writes to discovery bucket . In this process I’m facing error like not able to process the files present in location raw bucket ( from logs...
0
answers
0
votes
74
views
asked 9 months agolg...
I'm trying to update an existing AWS Glue Crawler for a DocumentDB instance. Given that it won't take a wildcard to add all the collections to the crawler I'm looking for an easy way to add several...
0
answers
0
votes
41
views
asked 9 months agolg...
Hello, we have an S3 bucket with various CSV files and an AWS Glue crawler to update the Data Catalog and finally an AWS Glue job to move the data to RedShift. The handling of data and target table is...
0
answers
0
votes
130
views
asked 9 months agolg...
I've got Athena setup to query a DocumentDB instance with the Lambda function built and AWS Glue configured. The setup was done through the datasource connector for DocumentDB.
I can see the database...
0
answers
0
votes
183
views
asked 10 months agolg...
Hello everyone.
Data from the rest api in the form of JSON is loaded daily by lambda into s3-bucket-1.
Then this data should be stored in s3-bucket-2 in the form of a flat parquet table.
I did it in...
0
answers
0
votes
78
views
asked 10 months agolg...
Hi,
Has anyone manage to make the Delta-Spark python package work with Glue interactive session? It works when I submit Glue 4.0 job but not on interactive session. Below are a few code snippets that...
0
answers
0
votes
53
views
asked 10 months agolg...
Hi Team,
I am using below code and its giving me the columns from the data, but my expectation to get the columns from Glue Data Catalog.
glueContext.create_dynamic_frame.from_catalog(database =...
0
answers
0
votes
94
views
asked 10 months agolg...