All Content tagged with Amazon EMR Studio

Content language: English

Select up to 5 tags to filter
Sort by most recent
22 results
Hello Team, Followed https://github.com/aws-samples/aws-emr-utilities/blob/main/utilities/emr-ec2-custom-python3/README.md#2-container-images-on-yarn Getting issues when we followed to deploy with d...
2
answers
0
votes
17
views
asked a day ago
I'm working with AWS EMR Serverless, and I need to construct a job URL for an EMR Serverless job to be sent in a message notification in case of state change. The desired URL includes the associated E...
1
answers
0
votes
30
views
asked 13 days ago
I am trying to load data from an S3 bucket to a Data Frame in EMR Studio. "df = spark.read.csv("s3://HIDDEN-sandbox/HIDDEN/avod_title_content_metrics_w/", header = True)" When I run "df.show(5)" I a...
1
answers
0
votes
83
views
asked 2 months ago
profile pictureAWS
published 3 months ago0 votes294 views
This spotlight on Amazon EMR equips you with the skills and troubleshooting tips to get the most out of a cloud big data platform service.
Hi, creating workspaces in EMR Studio and I get this error, Open failed Workspace(notebook) is stopped. Service Role does not have the required permissions. I dont see anywhere under workspace or app...
1
answers
0
votes
91
views
asked 5 months ago
* Setup like this is done: https://aws.amazon.com/blogs/big-data/bring-your-workforce-identity-to-amazon-emr-studio-and-athena/ * S3 Access point created * Bucket Policy to allow access via AP given *...
1
answers
0
votes
128
views
asked 5 months ago
I have been testing out using Livy endpoint from a EMR serverless application according to this documentation: https://docs.aws.amazon.com/emr/latest/EMR-Serverless-UserGuide/interactive-workloads-liv...
1
answers
0
votes
231
views
asked 6 months ago
When I query some delta lake tables, it gives ""internal server error"" and colum extract on left side panel, simply shows loading. Is it a bug or something missed at my end?
Accepted AnswerAmazon EMR Studio
1
answers
0
votes
159
views
asked 6 months ago
Here is the error I get in EMR Studio Workspace. "To link or unlink Git repositories, you must configure VPC network connections in the Edit Studio page" I have tried to create just a studio, config...
2
answers
0
votes
248
views
asked 8 months ago
I have an EMR workspace under which I have 4 Jupyter notebooks created on which PySpark code blocks are run. I want to get the last execution code block time across all 4 notebooks to determine the ti...
1
answers
0
votes
628
views
asked 9 months ago
I am running an EMR cluster with an attached notebook, and using Apache spark to load/process data however I have not been able to load data into Apache. Whenever I try to run spark.read.csv('s3://buc...
2
answers
0
votes
939
views
asked 9 months ago
I have a Python package saved in CodeCommit and need to use it in the notebook attached to my EMR cluster workspace. The package is already successfully installed via bootstrap. To do this, in my .sh ...
1
answers
0
votes
604
views
asked 10 months ago
  • 1
  • 2
  • 12 / page