Questions tagged with Amazon EMR
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I have a Python package saved in CodeCommit and need to use it in the notebook attached to my EMR cluster workspace.
The package is already successfully installed via bootstrap.
To do this, in my .sh...
0
answers
0
votes
137
views
asked 9 days agolg...
I have a Serverless EMR appication, I am submitting a spark job via python script. I have packaged all the dependencies an an the script to an s3 bucket. When I execute the job the spark job is...
2
answers
0
votes
162
views
asked 10 days agolg...
Hello,
I configured iceberg formatted table with transaction in hive on EMR 6.4.1. When I insert data into the table, the operation get stuck, without any error.
Any insights are highly...
Accepted AnswerAmazon EMR
1
answers
0
votes
174
views
asked 10 days agolg...
I've started seeing the following error on JupyterHub on EMR
`TypeError: required field "type_ignores" missing from Module`
from the simplest commands
![the...
2
answers
0
votes
164
views
asked 15 days agolg...
Hi Team,
We have EMR 6.10 cluster where flink jobs submitted to existing application. Container was running in task node in my case. Then I resized the task instance group from 1 to 0 in task instance...
Accepted AnswerAmazon EMR
1
answers
0
votes
143
views
asked 25 days agolg...
I need to load data from Kinesis Data Streams to EMR via EMR Studio. I Follow this sample but doesn't work: https://github.com/awslabs/spark-sql-kinesis-connector
1
answers
0
votes
160
views
asked 25 days agolg...
EMR on EKS Anywherelg...
I want to run EMR On-Premises, no Spark.
The question, is possible to run EMR (https://aws.amazon.com/emr/) on EKS Anywhere (https://aws.amazon.com/es/eks/eks-anywhere/)
Also, we don't have support...
1
answers
0
votes
174
views
asked a month agolg...
I am working with Step Function, and I have a MAP type step to which I pass an S3 path in which there is a csv on which the MAP has to iterate. In each loop of the map, a script is executed with the...
2
answers
0
votes
307
views
asked a month agolg...
Inquiry Regarding Spark Application Performance Discrepancy Across AWS Accounts in the Same Regionlg...
**Overview:**
The Spark application in question is deployed within AWS Account A, specifically in the us-west-2 region.
This application reads data from and writes data to Amazon S3 buckets hosted in...
1
answers
0
votes
147
views
asked a month agolg...
EMR API call? Trying to determine if there an API call to determine if "Automatically apply latest Amazon Linux updates" for EMR cluster was checked..
1
answers
0
votes
197
views
asked 2 months agolg...
Hi team
We want to use an EMR Cluster to process data with spark jobs
We have 30,000 files per day and approximately 2Gb of information, later it is planned that this will grow.
We have a small...
1
answers
0
votes
245
views
asked 2 months agolg...
I am using Jupyter notebook within Amazon EMR studio. I try to run my Jupyter notebook code and I get a kernel-related error (see attached screenshot). Also, my EMR instance is using an EC2 cluster. I...
1
answers
0
votes
262
views
asked 2 months agolg...