All Content tagged with AWS Glue

AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.

Content language: English

Select up to 5 tags to filter
Sort by most recent
1818 results
**Issue:** Our aim is to reduce logging to control data ingestion by 'PutLogEvent' metrics of CW. In past when we ran our glue job against a 35GB data size, we got a billed ~2K for cloudwatch most of...
1
answers
0
votes
80
views
asked 2 months ago
I have below Python script where currently it generates several gz files with size 4MB in S3 bucket. Its bydeafult what AWS glue has created. But now i want to create multiple files of each file size ...
2
answers
0
votes
79
views
asked 2 months ago
I have below python script in AWS Glue job. For incremental load logic i have now set the Job bookmark option to enable. And then i try to run the glue job again but it did not create any temporary fi...
1
answers
0
votes
68
views
asked 2 months ago
We have data stored in Cosmos DB NoSQL and need to migrate it to Snowflake using AWS Glue with a Change Data Capture (CDC) approach. Our objective is to perform CRUD operations based on CDC to handle...
1
answers
0
votes
50
views
asked 2 months ago
Hi Team, Have a AWS Glue job connection to mongo db atlas . Getting this error ServerSelectionTimeoutError: xyz.mongodb.net:27017: timed out error. How can i resolve this using AWS privatelink and M...
1
answers
0
votes
58
views
asked 2 months ago
Steps taken: 1. Select existing ETL Job (let's call it "sample-job"). 2. Clone job. 3. New job created, called "sample-job-copy". 4. Rename job. 5. Hit enter immediately after renaming. Outcome: New...
Accepted AnswerAWS Glue
2
answers
0
votes
82
views
asked 2 months ago
We are new to Glue env and dealing with our huge cloud-watch bill, we changed log-level in pyspark script from INFO to ERROR. We are using both python logger and spark logger as below in pyspark (Glue...
0
answers
0
votes
41
views
asked 2 months ago
![My Cost](/media/postImages/original/IMQAIamlJDTlC0OjRCQSZWWg) I'm using Pretier. EC2 is using t2.micro. I understand about the crawling cost of glue. But I don't understand why the cost of vpc happ...
1
answers
0
votes
58
views
asked 2 months ago
Is GitLab running on an organization's host(URL is other than https://gitlab.com/) suported by Git integration in Glue Studio? Thank you.
Accepted AnswerAWS Glue
2
answers
0
votes
110
views
asked 2 months ago
Hello. Is there any Glue API which can create a Glue job with a downloaded JSON code from Glue Studio editor? Or can the downloaded code only be used to upload to Glue Studio editor manually? Thank...
1
answers
0
votes
68
views
asked 2 months ago