SageMaker training job is not stopping

0

Hi,

Greetings!!

I found some errors in cloud watch logs while executing a training job via SageMake pipelines but unfortunately training job did not fail. Hence I tried stopping a training job using boto3 APIs below and AWS CLI as well but training job is in stopping status for a long time and it's not stopping.

stop_pipeline_execution()

stop_training_job()

stop-training-job --training-job-name <value>

How to kill the training jobs forcefully?

Thanks

demandé il y a 2 ans772 vues
1 réponse
0

Hello, did you notice any different error message on Cloudwatch after attempting to stop the training job ?

AWS
répondu il y a 2 ans

Vous n'êtes pas connecté. Se connecter pour publier une réponse.

Une bonne réponse répond clairement à la question, contient des commentaires constructifs et encourage le développement professionnel de la personne qui pose la question.

Instructions pour répondre aux questions