All Content tagged with Amazon EMR
Amazon EMR is a cloud big data platform for running large-scale distributed data processing jobs, interactive SQL queries, and machine learning (ML) applications using open-source analytics frameworks such as Apache Spark, Apache Hive, and Presto.
Content language: English
Filter content
Select tags to filter
Sort by
Sort by most recent
464 results

AWS OFFICIALUpdated 2 years ago
Henry Fuentes JrEXPERT
published 2 years ago0 votes713 views
This spotlight on Amazon EMR equips you with the skills and troubleshooting tips to get the most out of a cloud big data platform service.
I have a Trino on EMR setup, I need help on accessing Glue tables from the EMR.
Athena can access those tables.
Below is the error message after running trino cli `show tables;` command
```
dev-dsk...
1
answers
0
votes
83
views
asked 2 years ago
We use AWS EMR 7.2.0 on EC2 with instance fleets (only Primary, Core, no spot instances) and managed scaling for long term use (weeks). On each of the 3 cluster we started so far, we observed the foll...
2
answers
0
votes
465
views
asked 2 years ago
I am using EMR 711395599931.dkr.ecr.us-east-2.amazonaws.com/spark/emr-6.14.0:latest from SparkSubmitOperator and passing this to jar
where I am executing a User define function (UDF) in spark.
I am ge...
2
answers
0
votes
1.2K
views
asked 2 years ago
Hi.
I have set up an EMR Serverless application and i am using my custom image. I've configured everything properly.
Next, I've created a custom image according to the official docs:
https://docs.a...
2
answers
0
votes
485
views
asked 2 years ago

AWS OFFICIALUpdated 2 years ago