cannot Launch instance (kernel) with custom Sagemaker Image

1

The custom image has been used for a long time.

For some reason, when I created new instance by using custom image from sagemaker studio, I keep get "Failed to start kernel" error.

Same image could work fine in the past.

Also I noticed that if failed first, and try with other instance, it could work. Some times if you kept trying on same instance there was a chance worked as well.

I hope sagemaker team aware of this issue.

已提问 9 个月前324 查看次数
4 回答
1

I'm experiencing the same error. My image has also been used for months without any problems. It started yesterday.

Also, cloudwatch logs don't show any errors.

As a temporary fix, you can start an instance with a default kernel (data science 3.0) and then switch to your custom image.

Gonzalo
已回答 9 个月前
  • Thanks, this works :)

    But for some instance type (by default, only allow 1 running app per domain), I cannot use this way to switch because after I created one by default kernel, it won't let you create second one with custom image.

    Hope Sagemaker team notice this issue and implement the fix asap.

1

Enter image description here

已回答 9 个月前
0
已接受的回答

I have been worked with Sagemaker support and they implemented the changes, the instance should be created correctly with custom images.

during the issue, @Gonzalo's temp solution was working perfectly

已回答 9 个月前
0

I also tested running the notebook from "notebook job". Looks like the kernel from job scheduler can start kernel without custom image without error.

已回答 9 个月前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则