Unable to connect redshift from emr

1

Hi, I am getting yarn does not exist error when Connecting redshift cluster from emr where both are in same vpc. Is there any connectivity missing or any step that I need to make sure ?

Mark
질문됨 7달 전234회 조회
1개 답변
3
수락된 답변

Hello,

"FATAL: user "IAM:yarn" does not exist"

Here EMR is considering 'yarn' as a User which in turn trying to authenticate as DBUser using GetClusterCredentials API and add as a suffix for "IAM: ". It might be the case that if the user might not be exist or not provided in your URL. As you get user doest exist, your connectivity seems to be working fine. However, just in case you have not configured already then, You need to update the security group attached to the Redshift cluster to allow incoming traffic from EMR cluster by adding below inbound rules for EMR Master and EMR Slave Security Group and test the connectivity,

telnet redshift-cluster-1.xxxxxxxxx.xxxxxxxx.redshift.amazonaws.com 5439

Then try the following connection/compare with your connection URL,

df1 = spark.read.format("jdbc").option("url", "jdbc:redshift://<redshift endpoint:5439/<Database_Name>?user=<Admin_Username>&password=<Admin_Password>").option("dbtable", "public.category").load()
df1.printSchema()
df1.show()

If a user named DbUser exists in the database, the temporary user credentials have the same permissions as the existing user. If DbUser doesn't exist in the database and AutoCreate is true, a new user named DbUser is created. -

https://docs.aws.amazon.com/redshift/latest/APIReference/API_GetClusterCredentials.html

AWS
지원 엔지니어
답변함 7달 전
profile pictureAWS
전문가
검토됨 7달 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠