SageMaker: Deploying keras "model.h5" into sagemaker inference

Hello everyone! I have this problem, where I'm trying to deploy an emotion recognition model (format: model.h5) # keras model But I have tried a couple of ways but it isn't working out for me. I tried saving the model using tf.saved_model.save which resulted in this structure: saved_model/ ├── assets/ ├── variables/ │ ├── variables.data-00000-of-00001 │ └── variables.index └── saved_model.pb

Then I packaged it to : model.tar.gz/ ├── 1/ │ ├── assets/ │ ├── variables/ │ ├── variables.data-00000-of-00001 │ └── variables.index │ └── saved_model.pb

But it didn't work, this is the code:

model = TensorFlowModel(model_data='s3://BUCKET/my-model-1.tar.gz', 
                        role=role, 
                        framework_version='2.4')

predictor = model.deploy(initial_instance_count=1, instance_type='ml.m4.xlarge')```

I got this error:
UnexpectedStatusException: Error hosting endpoint tensorflow-inference-2023-09-21.....: Failed. Reason: The primary container for production variant AllTraffic did not pass the ping health check. Please check CloudWatch logs for this endpoint..

Checking cloudwatch logs i see this:
Traceback (most recent call last): File "/sagemaker/serve.py", line 444, in <module> ServiceManager().start() File "/sagemaker/serve.py", line 424, in start self._create_tfs_config() File "/sagemaker/serve.py", line 128, in _create_tfs_config raise ValueError("no SavedModel bundles found!")

Would appreciate any help!
Thanks

주제

AWS의 TensorFlow 기계 학습 및 AI DevOps

태그

AWS의 TensorFlow 아마존 SageMaker 기계 학습 및 AI DevOps

언어

English

Amer Alkrm

질문됨 8달 전112회 조회

답변 없음

최신
최다 투표
가장 많은 댓글

SageMaker: Deploying keras "model.h5" into sagemaker inference

관련 콘텐츠