Unanswered Questions tagged with Amazon EMR
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
Issue: PySpark works in the first cells (likely SparkSession creation) but throws import errors when using my Python files in later cells.
Environment: AWS EMR ( Amazon EMR...
0
answers
0
votes
316
views
asked a month agolg...
I have a Python package saved in CodeCommit and need to use it in the notebook attached to my EMR cluster workspace.
The package is already successfully installed via bootstrap.
To do this, in my .sh...
0
answers
0
votes
456
views
asked 2 months agolg...
I am trying to use aws emr-serverless get-dashboard-for-job-run cli command to pull information from emr-serverless but am stumped. This command returns a url and auth token. If I go to the url, it...
0
answers
0
votes
127
views
asked 4 months agolg...
When I try to create a new workspace for an AWS EMR Studio in the AWS Console, I get a blank page and a Javascript error in the console
("Failed to execute 'mark' on 'Performance':...
0
answers
0
votes
108
views
asked 5 months agolg...
I am facing issues using inline clustering and compaction in EMR, with the following error..
EMR : 6.13.0
Hudi: 0.13.1
com.esotericsoftware.kryo.KryoException: Unable to find class:...
0
answers
0
votes
96
views
asked 6 months agolg...
We are running an application based on EMR 5.36.0, and our security scans note several "high" impact Tomcat vulnerability (used internally by EMR). The last 5.x release was July 2022, and Tomcat was...
0
answers
0
votes
47
views
asked a year agolg...
This [documentation page][1] shows how to set the `JAVA_HOME` environment variable for EMR. I'm experimenting with running another version of Java, and I want to try passing some more command line...
0
answers
0
votes
115
views
asked a year agolg...
Hi team,
I see that deleting the EMR cluster sometimes does not necessarily delete the virtual clusters that they create and they remain running stale.
Everytime I see this issue, I can only use the...
0
answers
0
votes
53
views
asked a year agolg...
Hi there,
I need to create a custom kernel with my own libraries for use in EMR, so Im trying to follow this EMR...
0
answers
0
votes
85
views
asked a year agolg...
[https://docs.aws.amazon.com/emr/latest/EMR-Serverless-UserGuide/using-ddb-connector.html#using-ddb-connector-query](This doc page) describes connecting to dynamodb from spark, however it is...
0
answers
0
votes
96
views
asked a year agolg...
HIVE_UNSUPPORTED_FORMAT: Output format org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat with SerDe org.openx.data.jsonserde.JsonSerDe is not supported. If a data manifest file was generated...
0
answers
0
votes
182
views
asked a year agolg...
When selecting a pyspark kernel for a notebook in EMR studio, tab completion and tooltips (with shift-Tab) are not working as expected. This is especially true for for attribute listing after a dot...
0
answers
0
votes
80
views
asked 2 years agolg...