New user sign up using AWS Builder ID is currently unavailable on re:Post. To sign up, please use the AWS Management Console instead.
All Content tagged with Amazon EMR Studio
Content language: English
Select up to 5 tags to filter
Sort by most recent
22 results
Hello Team,
Followed https://github.com/aws-samples/aws-emr-utilities/blob/main/utilities/emr-ec2-custom-python3/README.md#2-container-images-on-yarn
Getting issues when we followed to deploy with d...
I'm working with AWS EMR Serverless, and I need to construct a job URL for an EMR Serverless job to be sent in a message notification in case of state change. The desired URL includes the associated E...
I am trying to load data from an S3 bucket to a Data Frame in EMR Studio.
"df = spark.read.csv("s3://HIDDEN-sandbox/HIDDEN/avod_title_content_metrics_w/", header = True)"
When I run "df.show(5)" I a...
EXPERT
published 3 months ago0 votes294 views
This spotlight on Amazon EMR equips you with the skills and troubleshooting tips to get the most out of a cloud big data platform service.
Hi,
creating workspaces in EMR Studio and I get this error, Open failed Workspace(notebook) is stopped. Service Role does not have the required permissions. I dont see anywhere under workspace or app...
* Setup like this is done: https://aws.amazon.com/blogs/big-data/bring-your-workforce-identity-to-amazon-emr-studio-and-athena/
* S3 Access point created
* Bucket Policy to allow access via AP given
*...
I have been testing out using Livy endpoint from a EMR serverless application according to this documentation: https://docs.aws.amazon.com/emr/latest/EMR-Serverless-UserGuide/interactive-workloads-liv...
When I query some delta lake tables, it gives
""internal server error""
and colum extract on left side panel, simply shows loading. Is it a bug or something missed at my end?
Here is the error I get in EMR Studio Workspace.
"To link or unlink Git repositories, you must configure VPC network connections in the Edit Studio page"
I have tried to create just a studio, config...
I have an EMR workspace under which I have 4 Jupyter notebooks created on which PySpark code blocks are run.
I want to get the last execution code block time across all 4 notebooks to determine the ti...
I am running an EMR cluster with an attached notebook, and using Apache spark to load/process data however I have not been able to load data into Apache. Whenever I try to run spark.read.csv('s3://buc...
I have a Python package saved in CodeCommit and need to use it in the notebook attached to my EMR cluster workspace.
The package is already successfully installed via bootstrap.
To do this, in my .sh ...