SageMaker training job is not stopping

0

Hi,

Greetings!!

I found some errors in cloud watch logs while executing a training job via SageMake pipelines but unfortunately training job did not fail. Hence I tried stopping a training job using boto3 APIs below and AWS CLI as well but training job is in stopping status for a long time and it's not stopping.

stop_pipeline_execution()

stop_training_job()

stop-training-job --training-job-name <value>

How to kill the training jobs forcefully?

Thanks

已提问 2 年前772 查看次数
1 回答
0

Hello, did you notice any different error message on Cloudwatch after attempting to stop the training job ?

AWS
已回答 2 年前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则