All Content tagged with AWS Glue
AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.
Content language: English
Select up to 5 tags to filter
Sort by most recent
Hi, I need help connecting to the Oracle on-prem database via JDBC in AWS Glue. I am using a custom driver. Our Oracle version is 19c and I'm using ojdbc10. I've been getting different errors and this...
Hello team,
I have a use case to read data from on-premise table to s3 using mwaa . Once the read completes i need to update a flag in the on-premise table { a typical oracle update records from glue...
I am blocked trying to use the OpenSearch Glue connection to connect to an OpenSearch domain that is hosted on a VPC. I have configured the connection to OpenSearch with a security group that allows...
Currently, I am using the spark hudi deltastreamer job to read CSV and write to the hudi table. Additionally, sync the s3 bucket with the AWS glue catalog. Deltastreamer job creates 2 tables...
Hello,
I am experiencing an issue with configuring AWS Glue to use Bitbucket for version control. Despite providing the necessary permissions and configuring the repository owner, I am unable to list...
Hello,
I am facing this weird issue from AWS Glue. I do have a NAT Gateway in the VPC, which should take care of the network issues. So I am not sure why the networking issue exists/persists. I...
I'm running a Visual ETL job under Glue service. I'm testing that service thru visual editor and I stated thru datasource pointing to some DynamoDB table (before I made a crawler, run it then I aws...
I have an EventBridge rule that triggers when a new file is added to an S3 bucket with the EventBridge target being a glue workflow. Now I want to pass event data from EventBridge to my glue workflow...
I zipped my modules into zip file, uploaded to s3 and added to Pyspark and Shell jobs under `Python library path
` parameter:
![Enter image description...
Hi, I have been using a docker image from amazon/aws-glue-libs:glue_libs_4.0.0_image_01 to run locally Glue Spark jobs.
I also want to test Ray locally in the same manner, by inspecting the image...
We are using Step Functions for our ETL pipeline. The first step kicks off 21 jobs that each take about 1-3 minutes each consuming 2 DPUs. The Step Function fails with the below error when trying to...