Questions tagged with AWS Glue
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I'm running an EMR Serverless Spark job that uses Delta OSS to handle Delta tables. I previously resolved a configuration issue with EMR Serverless and AWS Glue Data Catalog...
1
answers
0
votes
219
views
asked 11 days agolg...
Why doesn't Glue Job and Glue Workflow have the function of version control and alias likes Labmda.lg...
I tried to develop the data orchestlation with s3, Glue Job and Glue Workflow. After I developed it, I found that Glue Job and Glue Workflow doesn't have the function of version control and alias...
0
answers
0
votes
57
views
asked 11 days agolg...
Hi team, first post, let me know if it provides a good explanation.
I'd like to know a way to minimize the effort for data ingestion.
We have two options as follows:
(1) csv files from a file...
0
answers
0
votes
185
views
asked 12 days agolg...
I get the error "InvalidInputException: Unable to resolve any valid connection" when I test my AWS Glue connection to my mongDB Atlas database.
I can connect with an identical string, user and...
1
answers
0
votes
47
views
asked 13 days agolg...
Kinesis Firehose allows to configure S3 as a destination and in the Parquet section allows selecting a Glue catalog table of format Iceberg. However I had little to no luck querying the data.
Does...
1
answers
0
votes
63
views
asked 13 days agolg...
I am trying to connect AWS glue crawler with a postgres db on RDS. Both the crawler and DB are in the same region. steps followed:
1. created a connection with jdbc url username and...
1
answers
0
votes
76
views
asked 13 days agolg...
Hi all,
I have shared a Glue table (S3) with another account where I can already query it via Athena.
Now I added LakeFormation permissions for the database and table to the role that I am using...
1
answers
0
votes
102
views
asked 13 days agolg...
I want to use AWS Glue Data Catalog as a metastore. I'm running an EMR Serverless job that inserts and updates data in a Delta Table. I've successfully populated Delta tables on my localhost...
2
answers
0
votes
101
views
asked 13 days agolg...
Hi, I'm implementing a case where either one column can be null, but not both in the same record. And implementing rule
(ColumnValues "col_1" = NULL) or (ColumnValues "col_2" = NULL)
I'm seeing below...
1
answers
0
votes
43
views
asked 14 days agolg...
Glue Job - S3 to S3lg...
Hi Team,
I am working on Glue to job to copy/move file from one bucket to another bucket. Could you please help me with your thoughts
1. Using Python how to copy/move the unzipped file to target...
0
answers
0
votes
66
views
asked 14 days agolg...
Following this post: https://repost.aws/knowledge-center/glue-reduce-cloudwatch-logs
I have created the following glue job:
```
from awsglue.context import GlueContext
from pyspark.context import...
1
answers
0
votes
65
views
asked 14 days agolg...
Hi,
I am testing Amazon DataZone features and therefore set up a domain together with another associated account.
I enabled the DataLake blueprint in both accounts. I have 2 projects (producer,...
2
answers
0
votes
80
views
asked 15 days agolg...