Can we host AWS JumpStart Foundation models directly on AWS Inf1 or Inf2 Instances ?

0

As the title says, we can host LLM's and Stable diffusion models from jumpstart directly on SageMaker Inf1 or Inf2 chips ?

I tried doing that with Stable Diffusion 2 Model (i.e from studio notebook of AWS JumpStart Stable Diffusion, selected instance type as one of the AWS Inf chip). The endpoint got hosted as well but later failed at invoke endpoint step.

質問済み 9ヶ月前430ビュー
1回答
0

To deploy a model on Inf1 and Inf2 instances, you need to compile the model using AWS Neuron. In this documentation page you will find the updated list of Supported models for AWS Inferentia2, AWS Inferentia, and also AWS Trainium.

If you want to deploy Stable Diffusion on AWS Inferentia2, please see this blogpost for a full walkthrough.

Hope this helps.

profile pictureAWS
jnavrro
回答済み 9ヶ月前

ログインしていません。 ログイン 回答を投稿する。

優れた回答とは、質問に明確に答え、建設的なフィードバックを提供し、質問者の専門分野におけるスキルの向上を促すものです。

質問に答えるためのガイドライン

関連するコンテンツ