SageMaker training job is not stopping

0

Hi,

Greetings!!

I found some errors in cloud watch logs while executing a training job via SageMake pipelines but unfortunately training job did not fail. Hence I tried stopping a training job using boto3 APIs below and AWS CLI as well but training job is in stopping status for a long time and it's not stopping.

stop_pipeline_execution()

stop_training_job()

stop-training-job --training-job-name <value>

How to kill the training jobs forcefully?

Thanks

preguntada hace 2 años772 visualizaciones
1 Respuesta
0

Hello, did you notice any different error message on Cloudwatch after attempting to stop the training job ?

AWS
respondido hace 2 años

No has iniciado sesión. Iniciar sesión para publicar una respuesta.

Una buena respuesta responde claramente a la pregunta, proporciona comentarios constructivos y fomenta el crecimiento profesional en la persona que hace la pregunta.

Pautas para responder preguntas