Questions tagged with AWS Glue
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
Hi all,
I'm relatively new to Glue, but I've got a Python ETL script that I've built that works pretty well. It reads two CSV files into dataframes and then unions them together into one normalized...
0
answers
0
votes
47
views
asked a day agolg...
I have an AWS Glue connection pointing at an external Kafka Cluster. I have a table declared within AWS Glue pointing at a topic on my Kafka Cluster. It references the AWS Glue connection.
Within...
0
answers
0
votes
43
views
asked a day agolg...
Hey everyone I am trying to query a set of JSON files in S3 with Athena and I am getting the Hive cursor error for invalid JSON even though the files in question are valid single-line JSON. Is there a...
2
answers
0
votes
136
views
asked 2 days agolg...
I've got a fairly simple ETL job that reads several catalog tables or views and does some joins. the job errors out with the following error:
```
Error Category: UNCLASSIFIED_ERROR; An error...
0
answers
0
votes
166
views
asked 4 days agolg...
I have a AWS Glue workflow which is triggered when a file gets dropped into a s3 bucket thru evenbridge rule . Inside this Glue workflow I have setup a Glue trigger to trigger a ETL job. I have...
1
answers
0
votes
78
views
asked 5 days agolg...
HIVE_CANNOT_OPEN_SPLIT: Error opening Hive split...
1
answers
0
votes
250
views
asked 5 days agolg...
I am crawling data from S3. The data are stored in CSV form. This is how the directory looks like:
S3 Bucket
- logs
- north_america
- year=2024/
- europe
-...
1
answers
0
votes
62
views
asked 5 days agolg...
We would need to transfer the data from the firehose to parquet format using Glue and the final destination is to store in S3.
Access was denied when assuming role. Please ensure that the role...
1
answers
0
votes
94
views
asked 5 days agolg...
Hi
I am experimenting with a task about the "medaillon pattern".
I have three folder in one S3 bucket:
raw
silver
gold
and two Glue jobs:
- raw_to_silver which copies a couple of files from raw to...
0
answers
0
votes
92
views
asked 6 days agolg...
I keep getting this error, Error Category: UNCLASSIFIED_ERROR; An error occurred while calling o106.getDynamicFrame. Invalid connection string
But the connection string looks fine, looks identical to...
1
answers
0
votes
164
views
asked 9 days agolg...
I have csv files stored in S3. The files are named as followed: {region name}_{today's date}.csv. There are multiple regions. These files are saved under 'log/year/month/date' directory. So this...
1
answers
0
votes
222
views
asked 9 days agolg...
I have Excel sheets with multiple sheets on it stored in S3. Currently, I have separate csv files for each sheet, and crawling from each csv files. Instead of doing this, I would like to crawl from...
1
answers
0
votes
216
views
asked 9 days agolg...