Unable to load trained model in Sagemaker

0

I have trained a few models in sagemaker however I am unable to load them for prediction.

I am picking model details from: Sagemaker > Inference > Models > Container 1 section: Image_uri = value in image model_data = Value in model data location

then passing these values into sagemaker Model function.

When I deploy this model, it gives error: ping health check failed for AllTraffic production variant. This error doesn't come when I train a new model and deploy it.

1 Risposta
0

The cause for issues like this are due to a mismatch between the base model between the training and inference endpoints. A solution similar to below would help resolve your issue.

Github repo : https://github.com/marshmellow77/sm-extend-container/blob/main/02_extend_container.ipynb. talks about how to extend the existing Hugging Face DLCs by pulling them from the public ECR and running a simple Dockerfile on top of them that will install the latest available version of transformers.

AWS
marancs
con risposta un anno fa

Accesso non effettuato. Accedi per postare una risposta.

Una buona risposta soddisfa chiaramente la domanda, fornisce un feedback costruttivo e incoraggia la crescita professionale del richiedente.

Linee guida per rispondere alle domande