Explore how you can quickly prepare for, respond to, and recover from security events. Learn more.
Articles tagged with Amazon EMR
Amazon EMR is a cloud big data platform for running large-scale distributed data processing jobs, interactive SQL queries, and machine learning (ML) applications using open-source analytics frameworks such as Apache Spark, Apache Hive, and Presto.
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through articles or filter your results using the tools displayed.
12 results
EXPERT
published 19 days ago2 votes72 views
Performance testing for big data analytics tools and engines at petabyte scale is an increasingly challenging avenue. Using traditional sample test datasets may not reflect the actual production-grade...
SUPPORT ENGINEER
published 5 months ago3 votes1.3K views
This article offers instructions on how to set up and access Delta tables from SQL Explorer in EMR JupyterHub. SQL Explorer utilizes the Presto engine configured within the EMR cluster to process data...
SUPPORT ENGINEER
published 5 months ago2 votes2.2K views
This article offers instructions on how to configure additional Elastic Block Store (EBS) volumes for HDFS or YARN to increase the storage capacity of a running Amazon EMR cluster.
SUPPORT ENGINEER
published 7 months ago2 votes1.6K views
This article might provide guidance on configuring and accessing the Spark application UI for Interactive Endpoints that are either self-hosted notebooks or EMR Studio managed notebooks.
SUPPORT ENGINEER
published 8 months ago3 votes1.3K views
The guidance provided in the article could prove instrumental in conducting a comprehensive and systematic evaluation of the log data, potentially leading to the identification and resolution of the u...
SUPPORT ENGINEER
published 8 months ago3 votes1.6K views
This article might help to investigate the EMR cluster that terminated with error mentioned as "On the master instance, application provisioning failed".
SUPPORT ENGINEER
published 8 months ago3 votes1.1K views
This article might help to investigate the EMR cluster that terminated with error mentioned as "Master instance startup failed due to an internal error" especially when using custom AMI image.
SUPPORT ENGINEER
published 8 months ago3 votes1.5K views
This article might help to investigate the EMR cluster that terminated with error mentioned as "Failed to start the job flow due to an internal error" especially when using custom AMI image.
SUPPORT ENGINEER
published 8 months ago3 votes1.2K views
The Instance-state log available in Amazon EMR on EC2 that provides valuable information for troubleshooting application failures or investigating system details. This article describes the detailed i...
EXPERT
published 8 months ago0 votes1.7K views
Assist with build and install of prerequisite software for TensorFlow on Amazon Linux 2023 for Graviton
SUPPORT ENGINEER
published 9 months ago3 votes1.4K views
This article describes the high level procedure on how to integrate the tableau application with kerberized EMR cluster.
EXPERT
published 9 months ago1 votes1.5K views
Assist with build and install of prerequisite software for GeoPandas on Amazon Linux 2023 for Graviton