Import and Read csv using pyspark in Sagemaker Studio - JAVA_HOME is not set PySparkRuntimeError

0

I am getting java runtime error when I try to import and Read csv in S3 using pyspark in Sagemaker Studio. Can anyone help ?

https://github.com/rog-SARTHAK/AWS-Sagemaker-Studio/blob/main/Crime.ipynb

Sarthak
asked 6 months ago291 views
1 Answer
0

Hi Sarthak,

Kindly use a lifecycle configuration script as detailed at - https://aws.amazon.com/blogs/machine-learning/customize-amazon-sagemaker-studio-using-lifecycle-configurations/

Sample script below for reference -

#!/bin/bash

set -e

sudo yum -y update sudo yum -y install java-1.8.0-openjdk-devel

Kindly know if this helps.

AWS
answered 6 months ago
profile picture
EXPERT
reviewed 22 days ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions