All Content tagged with AWS Glue

AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.

Content language: English

Select tags to filter
Sort by most recent
2009 results
Explore how to access and analyze Arbitrum blockchain data using AWS Glue and Amazon Athena. Learn to run scalable queries on real-world data for DeFi, research, and more.
Hi Everyone I have multiple xml files and I'm leveraging glue job to parse those xml and convert into table that I can use it later on. I tried creating grok classifier and xml classifier but someho...
1
answers
0
votes
50
views
AWS
asked a month ago
Hi team, we are trying to upgrade from Glue 5.0 from Glue 4.0. However, we are getting errors and the message is that there is something wrong with our JDBC string. The current JDBC string we use is...
1
answers
0
votes
98
views
asked a month ago
I've created a Glue ETL job with a Redshift table as the source node and a dropDuplicates transform over some specific keys (not the entire row, because they have created timestamps). I understand the...
1
answers
0
votes
34
views
profile picture
asked a month ago
We have an AWS Glue pipeline where: A crawler populates a raw database table from partitioned JSON files in S3. S3 structure: ``` raw/ ├── org=21/ │ └── 221.json └── org=23/ └── 654.js...
2
answers
0
votes
98
views
asked a month ago
Let's say we have an s3 table with a specific set of fields defined for its schema, sometime later we send new data with more fields that do not exist in schema definition yet, now we update S3 table ...
2
answers
0
votes
111
views
asked a month ago
I have a large DDB table with 7 TB of data, 25 billion rows. This is a production table. I need to scan and add/update a column to each row of the table. The table has daily export to s3. I am conside...
2
answers
0
votes
67
views
asked a month ago
I am trying to set the identifier-field-ids to the Iceberg tables so that Firehose can perform update/delete operations on Iceberg tables as cannot add unique keys on dynamic database. I am creating I...
1
answers
0
votes
177
views
AWS
asked a month ago
Explore the power of XRP Ledger data analysis using AWS Public Blockchain Datasets. Learn how to access, query, and gain insights from XRP data using AWS Glue and Amazon Athena.
I have created sagemaker unified studio using manual setup method, I have created project within the domain having project profile of "All Capabilities" also I have created and added new compute of em...
2
answers
0
votes
78
views
asked a month ago
Hi, I'm attempting to create a Ray glue interactive session, Based on the [announcement blog post](https://aws.amazon.com/blogs/big-data/introducing-aws-glue-for-ray-scaling-your-data-integration-wor...
1
answers
0
votes
50
views
asked a month ago
I have created a compute connection for existing redshift serverless in sagemaker unified studio also added neccesary tag in workgroup as mentioned in the document. ![Enter image description here](/...
1
answers
0
votes
55
views
asked a month ago