Help us improve AWS re:Post! We're interested in understanding how you use re:Post and its impact on your AWS journey. Please take a moment to complete our brief 3-question survey.
All Content tagged with Amazon EMR
Amazon EMR is a cloud big data platform for running large-scale distributed data processing jobs, interactive SQL queries, and machine learning (ML) applications using open-source analytics frameworks such as Apache Spark, Apache Hive, and Presto.
Content language: English
Select tags to filter
Sort by most recent
438 results
Here's a link to my sample calculation: https://calculator.aws/#/estimate?id=e1754f12531b5a51f332143cb5e5a53e4a626f34
I read in another answer that short Serverless workloads are cheaper in general t...
Hi Mate,
I have steps running on EMR, which was working till 13th January 2025. After that I tried running the job today and it started failing with Error like : **AttributeError: module 'awscrt.chec...
Hello Team,
Followed https://github.com/aws-samples/aws-emr-utilities/blob/main/utilities/emr-ec2-custom-python3/README.md#2-container-images-on-yarn
Getting issues when we followed to deploy with d...
Hello,
Getting issues post custom ami use at EMR on Ec2 cluster with spark submit resulted in failure
```confs: [default]
0 artifacts copied, 60 already retrieved (0kB/30ms)
25/01/23 13:11:37 WAR...
Dear
i have emr run in old version,and our security tool inspected some security issue ,so i want to update the program of the emr cluster
and what is the best way to do this
Thanks
hello everyone!
I could not find a doc that explicitly mentioned there is any timeout for EMR cluster, do we know if EMR execution itself have a timeout?
would like to know how long the step is allow...
I'm working with AWS EMR Serverless, and I need to construct a job URL for an EMR Serverless job to be sent in a message notification in case of state change. The desired URL includes the associated E...
my pyspark job is failing at map partition function with JavaPackage object is not callable error. I have verified that the function I am passing to map partition function is callable and objects pass...
Aim: Create an EMR cluster and attach to a workspace, to use with JupyerLab.
EMR cluster created with default options: see end of this post for full description.
Creating the studio:
`aws emr crea...
When trying to access the oozie UI on the stated release label, the following error shows up, even with oozie installed and running, how to solve please?
```
HTTP Status 500 - java.lang.NoSuchMethodE...
I am trying to launch a cluster using a JSON script, and we are able to launch it successfully. However, when I attempt to add an 'AutoScalingPolicy' as part of the same JSON file in the Step Function...
Since upgrading from EMR 6.X to EMR 7.X, the spark history server produces three distinct Error 500s, the first being a basic JSON exception:
com.fasterxml.jackson.core.io.JsonEOFException: Unexpected...