Skip to content

Questions tagged with AWS Glue

AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.

Content language: English

Filter questions
Select tags to filter
Sort by
Sort by most recent
Filter Questions by:

Browse through the questions and answers listed below or filter and sort to narrow down your results.

2045 results
Hi, in AWS docs it is said that data that Glue jobs use is encrypted via KMS keys when it is at rest and in transit, which provides ambiguity since Glue jobs might use S3 instead of their local disk. ...
1
answers
0
votes
22
views
asked 4 days ago
While using **AWS DataZone Catalog**, I observed an issue with **glossary term navigation from assets**. ### **Observed behavior** * Assets (e.g., Glue Tables) show associated **Glossary Terms** as ...
1
answers
0
votes
27
views
asked 9 days ago
I have an IAM role for my bucket that has the following permissions: "s3:PutObject", "s3:GetObject", "iam:PassRole", "kms:Decrypt", "kms:Encrypt", "kms:GenerateDataKey", "s3:ListBucket", "s3:DeleteObj...
1
answers
0
votes
40
views
asked 10 days ago
I am attempting to use Hudi 1.1.1 in an AWS 5.0 Glue job. To avoid conflicts with the pre-installed Glue Hudi libraries, I have set the job parameter `--datalake-formats` to an empty string (""). Howe...
2
answers
0
votes
63
views
asked 13 days ago
Looking to implement sort compaction on an S3 Tables Iceberg table following the blog post: > [https://aws.amazon.com/blogs/aws/new-improve-apache-iceberg-query-performance-in-amazon-s3-with-sort-and...
4
answers
0
votes
104
views
asked 19 days ago
I'm using Amazon Kinesis Data Firehose to deliver streaming data into an Apache Iceberg table registered in the AWS Glue Data Catalog, which is part of a non-default catalog (s3tablescatalog/pulse-pro...
1
answers
0
votes
79
views
asked 24 days ago
I am building a Lakehouse solution using aws glue visual etl. When writing the dataset using the target s3 node in visual editor, there is no option to specify writemode() to overwrite When i checked ...
1
answers
0
votes
80
views
asked 25 days ago
We have a large-scale S3 data lake with the following characteristics: - Source: AWS Flink application writing Parquet files directly to S3 - Volume: ~4000 Parquet files per hour, ranging from 200GB ...
1
answers
0
votes
69
views
asked a month ago
We have glue crawler that runs on a S3 bucket. The bucket contains a single file only. This file get replaced with new data daily. Currently we run glue crawler everytime when this data. After which w...
2
answers
0
votes
26
views
asked a month ago
I’m running into an issue with AWS Clean Rooms when associating an AWS Glue table. The table association succeeds, but when another member of the collaboration tries to query it, they get: "Status not...
2
answers
0
votes
55
views
asked a month ago
This is a rant, not a question. There have been several posts about Glue reporting "Account is denied access". The recommendation in them was to contact AWS Support. One post clarified that in absence...
1
answers
-1
votes
42
views
asked a month ago
I'm encountering a CodeWhisperer: CodeWhisperer isn't enabled error in my AWS Glue interactive notebook, even after adding the necessary IAM permissions. Environment: - AWS Glue version: 5.0 - Regio...
2
answers
0
votes
30
views
asked a month ago
  • 1
  • 2
  • 3
  • 4
  • 5
  • •••
  • 171
  • Page size
    12 / page