All Content tagged with AWS Glue
AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.
Content language: English
Filter content
Select tags to filter
Sort by
Sort by most recent
2170 results
Currently on Glue 5.0 (us-west-2 region) Python.
I do want to add new partitions without having to create a new schema version - following a documented way of having updateBehavior set to LOG as belo...
2
answers
0
votes
80
views
asked 5 months ago
I've created a Glue Crawler to determine data structure from an XML file uploaded to s3, and write Table into Data Catalog.
I have created a custom Glue XML classifier. It has been working very well....
Accepted AnswerAWS Glue
1
answers
0
votes
92
views
asked 5 months ago
Hi,
We would like to understand whether it is possible to enforce Tagging through SCP during S3 Bucket Creations, Dynamo DB Creation, AWS SageMaker AI NoteBook Instance Creation, AWS Glue Creation, A...
3
answers
0
votes
210
views
asked 6 months ago
Hi everyone,
I’m trying to create a Redshift connection inside AWS Glue (Glue 5.0), but I always get this error:
AccessDeniedException: AccessDeniedException when creating service linked secrets, insu...
1
answers
0
votes
254
views
asked 6 months ago
I have registered a Redshift Serverless namespace to AWS Glue Data Catalog via Lake Formation and am trying to query the tables using Amazon Athena, but I'm getting an error: "Queries of this type are...
1
answers
0
votes
224
views
asked 6 months ago
I’m unable to create an AWS Glue crawler in my AWS account, and the error suggests the problem may be related to account-level Glue provisioning, not IAM.
Environment
Region: us-east-1
Not part of an...
1
answers
0
votes
269
views
asked 6 months ago
I have a s3 bucket with nested partition, first is the account_id and another is dt=yyyy-mm-dd-hh-mm, of 10 min interval, for a day there can be 24*6 partitions. There can be missing partition in betw...
1
answers
0
votes
1.1K
views
asked 6 months ago
How can I clone, or make a copy, or a Workflow?
I can't seem to find any docs around this, and my attempts with the [CLI](https://docs.aws.amazon.com/cli/latest/reference/glue/create-workflow.html) a...
1
answers
0
votes
124
views
asked 6 months ago
I have a Workflow with one Trigger. I assign two Jobs to the Trigger, and then realise one of the Jobs was added in error.
The [docs](https://docs.aws.amazon.com/glue/latest/dg/creating_running_workf...
Accepted AnswerAWS Glue
2
answers
0
votes
126
views
asked 6 months ago
Hello,
We are interested in using the Table Optimizer feature provided by AWS Glue, but we noticed that its not available in us-west-1 where our buckets and resources are present. Relevant AWS docume...
1
answers
0
votes
97
views
asked 6 months ago
I have written a custom script to transform a column, data transformed successfully and I can verify that in data preview in aws glue studio, acft_tail_num -> 8911(earlier this was N8911 ) when I joi...
2
answers
0
votes
87
views
asked 6 months ago
Spark workers generate a lot of logs and most of the information is not required on day-to-day basic. I would like to pay less for logs pushed and at the same time to have control over the logs verbos...
1
answers
0
votes
244
views
asked 6 months ago