Questions tagged with AWS Glue
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I'm running an EMR Serverless Spark job that uses Delta OSS to handle Delta tables. I previously resolved a configuration issue with EMR Serverless and AWS Glue Data Catalog...
0
answers
0
votes
1
views
asked 11 minutes agolg...
Why doesn't Glue Job and Glue Workflow have the function of version control and alias likes Labmda.lg...
I tried to develop the data orchestlation with s3, Glue Job and Glue Workflow. After I developed it, I found that Glue Job and Glue Workflow doesn't have the function of version control and alias...
0
answers
0
votes
2
views
asked 15 minutes agolg...
Hi team, first post, let me know if it provides a good explanation.
I'd like to know a way to minimize the effort for data ingestion.
We have two options as follows:
(1) csv files from a file...
0
answers
0
votes
19
views
asked 8 hours agolg...
I get the error "InvalidInputException: Unable to resolve any valid connection" when I test my AWS Glue connection to my mongDB Atlas database.
I can connect with an identical string, user and...
1
answers
0
votes
22
views
asked a day agolg...
Kinesis Firehose allows to configure S3 as a destination and in the Parquet section allows selecting a Glue catalog table of format Iceberg. However I had little to no luck querying the data.
Does...
1
answers
0
votes
26
views
asked a day agolg...
I am trying to connect AWS glue crawler with a postgres db on RDS. Both the crawler and DB are in the same region. steps followed:
1. created a connection with jdbc url username and...
1
answers
0
votes
42
views
asked 2 days agolg...
Hi all,
I have shared a Glue table (S3) with another account where I can already query it via Athena.
Now I added LakeFormation permissions for the database and table to the role that I am using...
1
answers
0
votes
36
views
asked 2 days agolg...
I want to use AWS Glue Data Catalog as a metastore. I'm running an EMR Serverless job that inserts and updates data in a Delta Table. I've successfully populated Delta tables on my localhost...
2
answers
0
votes
35
views
asked 2 days agolg...
Hi, I'm implementing a case where either one column can be null, but not both in the same record. And implementing rule
(ColumnValues "col_1" = NULL) or (ColumnValues "col_2" = NULL)
I'm seeing below...
1
answers
0
votes
24
views
asked 2 days agolg...
Glue Job - S3 to S3lg...
Hi Team,
I am working on Glue to job to copy/move file from one bucket to another bucket. Could you please help me with your thoughts
1. Using Python how to copy/move the unzipped file to target...
0
answers
0
votes
40
views
asked 2 days agolg...
Following this post: https://repost.aws/knowledge-center/glue-reduce-cloudwatch-logs
I have created the following glue job:
```
from awsglue.context import GlueContext
from pyspark.context import...
1
answers
0
votes
49
views
asked 3 days agolg...
Hi,
I am testing Amazon DataZone features and therefore set up a domain together with another associated account.
I enabled the DataLake blueprint in both accounts. I have 2 projects (producer,...
2
answers
0
votes
65
views
asked 3 days agolg...