By using AWS re:Post, you agree to the AWS re:Post Terms of Use

All Content tagged with Amazon EMR Studio

Content language: English

Select up to 5 tags to filter
Sort by most recent
20 results
I am trying to load data from an S3 bucket to a Data Frame in EMR Studio. "df = spark.read.csv("s3://HIDDEN-sandbox/HIDDEN/avod_title_content_metrics_w/", header = True)" When I run "df.show(5)" I a...
1
answers
0
votes
25
views
asked 16 days ago
profile pictureAWS
published a month ago0 votes137 views
This spotlight on Amazon EMR equips you with the skills and troubleshooting tips to get the most out of a cloud big data platform service.
Hi, creating workspaces in EMR Studio and I get this error, Open failed Workspace(notebook) is stopped. Service Role does not have the required permissions. I dont see anywhere under workspace or app...
1
answers
0
votes
70
views
asked 3 months ago
* Setup like this is done: https://aws.amazon.com/blogs/big-data/bring-your-workforce-identity-to-amazon-emr-studio-and-athena/ * S3 Access point created * Bucket Policy to allow access via AP given *...
1
answers
0
votes
86
views
asked 3 months ago
I have been testing out using Livy endpoint from a EMR serverless application according to this documentation: https://docs.aws.amazon.com/emr/latest/EMR-Serverless-UserGuide/interactive-workloads-liv...
1
answers
0
votes
147
views
asked 4 months ago
When I query some delta lake tables, it gives ""internal server error"" and colum extract on left side panel, simply shows loading. Is it a bug or something missed at my end?
Accepted AnswerAmazon EMR Studio
1
answers
0
votes
129
views
asked 5 months ago
Here is the error I get in EMR Studio Workspace. "To link or unlink Git repositories, you must configure VPC network connections in the Edit Studio page" I have tried to create just a studio, config...
2
answers
0
votes
214
views
asked 7 months ago
I have an EMR workspace under which I have 4 Jupyter notebooks created on which PySpark code blocks are run. I want to get the last execution code block time across all 4 notebooks to determine the ti...
1
answers
0
votes
610
views
asked 7 months ago
I am running an EMR cluster with an attached notebook, and using Apache spark to load/process data however I have not been able to load data into Apache. Whenever I try to run spark.read.csv('s3://buc...
2
answers
0
votes
910
views
asked 7 months ago
I have a Python package saved in CodeCommit and need to use it in the notebook attached to my EMR cluster workspace. The package is already successfully installed via bootstrap. To do this, in my .sh ...
1
answers
0
votes
584
views
asked 8 months ago
I need to load data from Kinesis Data Streams to EMR via EMR Studio. I Follow this sample but doesn't work: https://github.com/awslabs/spark-sql-kinesis-connector
1
answers
0
votes
1.6K
views
AWS
asked 9 months ago
hello i'am using EMR Notebook attached to EMR Serverless Application. i'm wondering how can i get the pricing information? there's nowhere i can find about the price how much i used. when i execute th...
2
answers
0
votes
410
views
asked 10 months ago
  • 1
  • 2
  • 12 / page