SageMaker training job is not stopping

0

Hi,

Greetings!!

I found some errors in cloud watch logs while executing a training job via SageMake pipelines but unfortunately training job did not fail. Hence I tried stopping a training job using boto3 APIs below and AWS CLI as well but training job is in stopping status for a long time and it's not stopping.

stop_pipeline_execution()

stop_training_job()

stop-training-job --training-job-name <value>

How to kill the training jobs forcefully?

Thanks

posta 2 anni fa772 visualizzazioni
1 Risposta
0

Hello, did you notice any different error message on Cloudwatch after attempting to stop the training job ?

AWS
con risposta 2 anni fa

Accesso non effettuato. Accedi per postare una risposta.

Una buona risposta soddisfa chiaramente la domanda, fornisce un feedback costruttivo e incoraggia la crescita professionale del richiedente.

Linee guida per rispondere alle domande