Sagemaker endpoint does not use latest approved model version

0

I'm trying to troubleshoot why the sagemaker staging endpoint is not using the latest model version. I did the following steps:

  1. I started a new Sagemaker project and used the build, train, deploy and monitor model template.
  2. The pipeline automatically executes upon creating the project and I approved that model. The endpoint was successfully created using this model version 1.
  3. I cloned the repo, made a inconsequential change (just changed some training hyper params) and pushed the changes. Pipeline executes again successfully creating model version 2.
  4. I approved the version 2 and deploy script also runs successfully it seems based on the deploy logs but the endpoint keeps using version 1 of the model. (Also for some reason version 1 of the model appears as "Model-xxxxxxx" in the Models list while version 2 appears as "Pipeline-xxxxxx-xxxxxx"

Would appreciate any help. Thank you!

  • Did you approve the transition to "production" in CodeBuild for the first model? In the deploy script, in the function that retrieves the latest approved model, can you print which model it finds and verify that it's finding version 2? Additionally, the deployment script prints out cloudformation templates, did you verify that it shows version 2? If you can paste the deploy logs here that would be helpful.

Keine Antworten

Du bist nicht angemeldet. Anmelden um eine Antwort zu veröffentlichen.

Eine gute Antwort beantwortet die Frage klar, gibt konstruktives Feedback und fördert die berufliche Weiterentwicklung des Fragenstellers.

Richtlinien für die Beantwortung von Fragen