Skip to content

Connecting EMR Studio with Git

0

Here is the error I get in EMR Studio Workspace.

"To link or unlink Git repositories, you must configure VPC network connections in the Edit Studio page"

I have tried to create just a studio, configure the VPC. But even though I have created an Interactive Studio, it does not allow me to create new workspace.

asked 2 years ago508 views
3 Answers
3

Hello,

Please check your cloudTrail API log to find out what API call failing. Basically you need to make sure studio service role has necessary permissions.

Also, please consider the VPC requirements in private subnet.

(i) In your VPC you should have atleast 1 public subnet and then you can create private subnet.

(ii). If you do not attach internet gateway to subnet route table then subnet will be private subnet.

AWS
SUPPORT ENGINEER
answered 2 years ago
0

Can you please describe the error you were seeing while creating new workspace.

answered 2 years ago
0

To connect EMR Studio with Git, you can follow these steps:

Ensure prerequisites are met:

Your EMR Studio must be configured to allow Git repository linking. You need appropriate permissions to access Git repositories. EMR clusters must have internet access through a NAT gateway or virtual private gateway. Open your EMR Studio Workspace:

Navigate to the Workspaces list in your EMR Studio. Select the Workspace you want to link to a Git repository. Access the Git repository tool:

In the left sidebar, click on the "Amazon EMR Git Repository" icon. This will open the Git repository tool panel. Add a new Git repository:

Click on "Add new Git repository" in the tool panel. Provide a descriptive name for the repository (use only alphanumeric characters, hyphens, and underscores). Enter the Git repository URL. Specify the branch name you want to check out. Set up Git credentials:

Choose how to manage Git credentials using AWS Secrets Manager. You can create a new secret or use an existing one. For username/password authentication, enter the credentials. For personal access token (PAT) authentication, enter your token. Link the repository:

After adding the repository, select it from the dropdown list to link it to your Workspace. Wait for the linking process to complete. Use Git in your Workspace:

The linked repository will appear in the File Browser panel. You can use Git commands through a terminal while connected to the cluster's primary node. Alternatively, use the jupyterlab-git extension for a graphical interface. Remember to follow security best practices when managing Git credentials and repositories. Always use the principle of least privilege when setting up permissions. For the most up-to-date and detailed instructions, please refer to the official AWS documentation on EMR Studio and Git integration.

answered a year ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.