All Content tagged with AWS Glue

AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.

Content language: English

Select up to 5 tags to filter
Sort by most recent
I want to create a crawler on my RDS database but I cannot create the role needed as it it disabled. The AWS console user I am using has admin level role. ![Enter image description...
1
answers
0
votes
143
views
Remiby
asked a month ago
I have written an ETL job in AWS Glue using the interactive notebook and I want to enable job bookmark to avoid reprocessing already processed data. The source data are in an S3 bucket, a Glue data...
2
answers
0
votes
181
views
Vas
asked a month ago
Hi, I am using a s3 bucket for data shuffling. The Glue job failed with the following error: "An error occurred while calling o147.saveAsTable. Job aborted due to stage failure: ResultStage 5...
2
answers
0
votes
164
views
mykc
asked a month ago
Hallo, I wanted to add file pattern in AWS Glue ETL job python script where it should generate the files in s3 bucket with pattern dostrp*.csv.gz but could not find way how to provide this file...
Accepted AnswerAWS Glue
1
answers
0
votes
131
views
RahulD
asked a month ago
I am developing a data pipeline for building a Redshift data warehouse as below: 1. Export DynamoDB data to S3 using 'export to S3' feature 2. In glue, create a spark data-frame on the S3 exported...
0
answers
0
votes
158
views
asked a month ago
I'm trying to build a regular expression for a Grok pattern. The goal is: giving a sequence of digits, I need to assign the first eleven numbers to a named capturing group, and then digits from...
0
answers
0
votes
145
views
profile picture
asked a month ago
Hi, I am using s3 for data shuffling in my glue job. When I ran the notebook, it failed with FileNotFoundException of some objects. However, I can see the objects on the shuffling s3 bucket. Any...
2
answers
0
votes
158
views
mykc
asked a month ago
**Overview of Your AWS Architecture:** - **Data Source:** RDS Aurora MySQL - **ETL Process:** AWS Glue - **Data Target:** Amazon Redshift - **Reporting Tool:** Amazon QuickSight **My Objective:** i...
1
answers
0
votes
173
views
Mouhcin
asked a month ago
Hi, I would appreciate any help on how I can connect or test the connection to Oracle on-prem either from Glue or other services. I followed the steps from this article...
Accepted AnswerAWS Glue
1
answers
0
votes
153
views
JSM
asked a month ago
Hi, I need help connecting to the Oracle on-prem database via JDBC in AWS Glue. I am using a custom driver. Our Oracle version is 19c and I'm using ojdbc10. I've been getting different errors and this...
0
answers
0
votes
122
views
JSM
asked a month ago
Hello team, I have a use case to read data from on-premise table to s3 using mwaa . Once the read completes i need to update a flag in the on-premise table { a typical oracle update records from glue...
0
answers
0
votes
319
views
asked a month ago
I am blocked trying to use the OpenSearch Glue connection to connect to an OpenSearch domain that is hosted on a VPC. I have configured the connection to OpenSearch with a security group that allows...
2
answers
0
votes
248
views
asked a month ago