Error Using Local GPU Mode in SageMaker Notebook

0

Hello,

Currently receiving an error when fitting an estimator: I'm using an ml.p3.2xlarge. Instance type = "local_gpu"

from sagemaker.pytorch import PyTorch

env = {
    'SAGEMAKER_REQUIREMENTS': 'requirements.txt', # path relative to `source_dir` below.
}

estimator = PyTorch(
    entry_point="train.py",
    role=role,
    py_version="py3",
    framework_version= "1.7.1", #"1.8.0",
    instance_count=1,
    instance_type= instance_type
    hyperparameters={
        "n_classes": 3
    },
)

This is the error. I've been troubleshooting and even referenced Github official examples.

 Command '['docker', 'pull', '763104351884.dkr.ecr.us-east-1.amazonaws.com/pytorch-training:1.7.1-gpu-py3']' returned non-zero exit status 1.

Any help would be appreciated.

Samuel
질문됨 8달 전318회 조회
1개 답변
0

Hello.

A bug with the same problem as yours was reported here.
It's probably not resolved yet.
https://github.com/aws/amazon-sagemaker-examples/issues/3110

profile picture
전문가
답변함 8달 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠