Questions tagged with AWS Glue
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I need to create a POC to implement an ETL process with SAS tables from a Glue job without business rules with 7 source tables adding another Job with business rules in each table and after the ETL...
1
answers
0
votes
73
views
asked 10 hours agolg...
Hi all,
I'm relatively new to Glue, but I've got a Python ETL script that I've built that works pretty well. It reads two CSV files into dataframes and then unions them together into one normalized...
1
answers
0
votes
68
views
asked 3 days agolg...
I have an AWS Glue connection pointing at an external Kafka Cluster. I have a table declared within AWS Glue pointing at a topic on my Kafka Cluster. It references the AWS Glue connection.
Within...
0
answers
0
votes
54
views
asked 3 days agolg...
Hey everyone I am trying to query a set of JSON files in S3 with Athena and I am getting the Hive cursor error for invalid JSON even though the files in question are valid single-line JSON. Is there a...
2
answers
0
votes
152
views
asked 4 days agolg...
I've got a fairly simple ETL job that reads several catalog tables or views and does some joins. the job errors out with the following error:
```
Error Category: UNCLASSIFIED_ERROR; An error...
0
answers
0
votes
178
views
asked 6 days agolg...
I have a AWS Glue workflow which is triggered when a file gets dropped into a s3 bucket thru evenbridge rule . Inside this Glue workflow I have setup a Glue trigger to trigger a ETL job. I have...
1
answers
0
votes
89
views
asked 6 days agolg...
HIVE_CANNOT_OPEN_SPLIT: Error opening Hive split...
1
answers
0
votes
263
views
asked 6 days agolg...
I am crawling data from S3. The data are stored in CSV form. This is how the directory looks like:
S3 Bucket
- logs
- north_america
- year=2024/
- europe
-...
1
answers
0
votes
74
views
asked 7 days agolg...
We would need to transfer the data from the firehose to parquet format using Glue and the final destination is to store in S3.
Access was denied when assuming role. Please ensure that the role...
1
answers
0
votes
110
views
asked 7 days agolg...
Hi
I am experimenting with a task about the "medaillon pattern".
I have three folder in one S3 bucket:
raw
silver
gold
and two Glue jobs:
- raw_to_silver which copies a couple of files from raw to...
0
answers
0
votes
103
views
asked 7 days agolg...
I keep getting this error, Error Category: UNCLASSIFIED_ERROR; An error occurred while calling o106.getDynamicFrame. Invalid connection string
But the connection string looks fine, looks identical to...
1
answers
0
votes
176
views
asked 10 days agolg...
I have csv files stored in S3. The files are named as followed: {region name}_{today's date}.csv. There are multiple regions. These files are saved under 'log/year/month/date' directory. So this...
1
answers
0
votes
235
views
asked 11 days agolg...