Skip to content

All Content tagged with Amazon EMR

Amazon EMR is a cloud big data platform for running large-scale distributed data processing jobs, interactive SQL queries, and machine learning (ML) applications using open-source analytics frameworks such as Apache Spark, Apache Hive, and Presto.

Content language: English

Filter content
Select tags to filter
Sort by
Sort by most recent
460 results
I'm trying to install Python modules in my AWS Glue Python Shell job using wheel files stored in Amazon Simple Storage Service (Amazon S3). My job runs in a private Virtual Private Cloud (VPC) with Am...
This framework provides a structured approach for migrating analytics workloads from EMR on EC2 to EMR Serverless in enterprise environments. It guides organizations through the complete migration lif...
Enterprises struggle with EMR version upgrades, facing challenges like production downtime, performance degradation, and compliance risks. Without a structured approach, organizations often experience...
Hello, I am running an Apache Spark job on Amazon EMR that needs to connect to an Amazon MSK cluster configured with IAM authentication. The EMR cluster has an IAM role with full MSK permissions, and...
1
answers
0
votes
218
views
asked 5 months ago
Hi Team, I'm trying to set up the Amazon Q on the EMR studio notebook workspace and followed this guide: https://docs.aws.amazon.com/amazonq/latest/qdeveloper-ug/emr-setup.html?trk=769a1a2b-8c19-4976...
1
answers
0
votes
105
views
asked 5 months ago
Getting this issue in Amazon EMR during a pyspark job execution. ``` df = spark.read.parquet("s3a://test/raw-billing-cor-data/cur2/123456789/cid-cur2/data/BILLING_PERIOD=2025-08/") py4j.protocol.Py4...
1
answers
0
votes
140
views
asked 5 months ago
Hi. I am trying to configure ZGC in HBase following the recommendations, but the JAVA_HOME and HBASE_REGIONSERVER_GC_OPTS variables are not modified in the /etc/hbase/conf/hbase-env.sh file. Has anyo...
1
answers
0
votes
68
views
asked 5 months ago
I am deploying an EMR HBase cluster with EMR WAL enabled using Terraform. The cluster is created successfully and the WALs are visible using the emrwal CLI. When I change some configuration of my clus...
1
answers
0
votes
97
views
asked 6 months ago
Hi everyone, I currently have an EMR cluster (emr-6.9.0) running a real-time ingestion process. To save disk space, I’ve been using the **Cloud Shuffle Storage Plugin** for Apache Spark. Now, I need ...
1
answers
0
votes
210
views
asked 6 months ago
Hi , Recently i started facing issues with EMR (EC2 is out of capacity), mentioning that "EC2 is out of capacity for m6a.12xlarge in availability zone us-east-1c" I tried different machines in same ...
Accepted AnswerAmazon EC2Amazon EMR
1
answers
0
votes
153
views
asked 8 months ago
  • 1
  • 2
  • 3
  • 4
  • 5
  • •••
  • 39
  • Page size
    12 / page