Questions tagged with Amazon EMR
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
The Zero ETL Integration for replicating data to Redshift from Aurora PostgreSQL is currently in "Preview", as [this post specifies...
1
answers
0
votes
202
views
asked 3 months agolg...
Is there a way to use s3-dist-cp to copy files from a bucket that uses Requestor payments?
2
answers
0
votes
268
views
asked 3 months agolg...
EMRFS write errorslg...
Upgrading from EMR versions 6.11 to 6.12 (even tried 7.0.0), I'm seeing these errors on the same exact job with the same resources - has something changed with how EMRFS has been implemented? What is...
1
answers
0
votes
497
views
asked 3 months agolg...
Good morning,
As recently, a vulnerability on Resource Manager has been exploited, we are worried and want to confirm with you about the impact....
2
answers
0
votes
209
views
asked 3 months agolg...
I am trying to install happybase package on Zeppelin notebook ( or for that matter any package ) . How do I do a pip install from a Zeppelin cell .
%pip or !pip is not recognized
2
answers
0
votes
173
views
asked 3 months agolg...
Is there a way to check the integrity of files copied with S3DistCp at the end of the copy, like DistCp checksum?
1
answers
0
votes
215
views
asked 4 months agolg...
EMR had 1 primary, 1 core and 5 task nodes. All 3 group of nodes were on demand (including task group). I didn't use spot purchasing for task group to avoid unexpected termination. But still EMR...
1
answers
0
votes
342
views
asked 4 months agolg...
In AWS EMR, I encountered the following error message when running a pyspark job, which ran successfully on my local machine.
> [System Error] Fail to delete the temp folder
Is there a way to...
Accepted AnswerAmazon EMR
1
answers
0
votes
202
views
asked 4 months agolg...
When using EMR 7.0.0 in EMR Serverless (have not tried EKS or EC2), after connecting to the application through a EMR Studio workspace, the pyspark kernel doesn't work in a notebook. It stays in...
1
answers
0
votes
274
views
asked 4 months agolg...
Hi,
We have an EMR cluster with multiple concurrent steps gets executed seamlessly. Not sure what happened certainly, but the step logs, application logs are not published to s3 from yesterday....
Accepted AnswerAmazon EMR
3
answers
0
votes
292
views
asked 4 months agolg...
I am trying to use aws emr-serverless get-dashboard-for-job-run cli command to pull information from emr-serverless but am stumped. This command returns a url and auth token. If I go to the url, it...
0
answers
0
votes
116
views
asked 4 months agolg...
Hi,
after EMR 7.0.0 was released in the previous week, we wanted to start using it.
# Problem
We have shell script EMR steps that are executed during the start of the cluster. These EMR steps never...
2
answers
0
votes
308
views
asked 4 months agolg...