[Feature Request] Serverless Inference with VPC Config

1

I would like to use a Sagemaker Model with a custom VPC Configuration, which is currently not possible with Serverless Inference. Is this feature planned? More generally: Is there a roadmap somewhere for Serverless Inference?

Richard
已提問 2 年前檢視次數 843 次
3 個答案
0

Any updates on this?

Btw, you should add a warning to the documentation of Model.deploy() here: https://sagemaker.readthedocs.io/en/v2.169.0/api/inference/model.html as I've been getting a ValidationException and I've been trying to debug it for hours, without a clue for why it's failing. I am also using a VPC config. Honestly AWS, fix your damn documentation.

mpw
已回答 10 個月前
  • Hi! We also lost several hours because of incomplete and misleading documentation. What we ended up doing is to create a VPC network for our Redis Cluster which Sagemaker was not part of and handled the caching in the API calling the Sagemaker endpoint. Maybe you can use a similar approach.

0

SageMaker Serverless Inference is currently in preview and VPC support is not available but as the feature you are asking for is an important one and is on the roadmap( unfortunately I cannot share the exact details of the timelines here)

AWS
已回答 2 年前
0

How long will this be on preview. I hope when it comes out, it will have support for VPC

已回答 1 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南