Do ml.inf machines support multi-model endpoints?

We have been trying to deploy our multiple models to a multi-model endpoint that uses inference machines (inf.xlarge) without luck. ClientError: An error occurred (ValidationException) when calling the CreateEndpointConfig operation: MultiModel mode is not supported for instance type ml.inf1.xlarge.

This isn't good, is that really the case, or have we messed up somewhere during the process?

Thanks

トピック

機械学習と AI 計算する

タグ

機械学習と AI AWS Inferentia Amazon マシンイメージ (AMI)

言語

English

AWS-User-1934187

質問済み 2年前515ビュー

1回答

新しい順
投票が多い順
コメントが多い順

Unfortunately no, I believe it's not currently supported and the error message you saw is in line with that.

I'd like to see the wording on this page (which says "Multi-model endpoints are not supported on GPU instance types.") expanded to make this clearer since Inferentia accelerators aren't "GPUs" as such.

You could perhaps look at testing CPU inference performance for MME serving of a large number of models, or push some of your higher-traffic models to dedicated single-model endpoints on Inferentia?

エキスパート

Alex_T

回答済み 2年前

AWS-User-1934187
2年前
What a shame, we handle many concurrent requests per second, and inference machines were the best ones we found... Is there any machine that can withstand a similar workload without costing us a fortune?

関連するコンテンツ

how to create an instance VM server and how to make a plan for 3 years?
kou
質問済み 2ヶ月前
How to add a policy to the permission policy of IAM roles that allows only users verified through the Cognito user pool to download files during S3 download.如何追加一个策略，使得该策略可以在S3下载时，仅允许通过Cognito用户池验证的用户
承認された回答
Dgk
質問済み 3ヶ月前
RDS for MySQL 8.0.35でPlugin mysql_native_password reported: ''mysql_native_password' is deprecated and will be removed in a future release. Please use caching_sha2_password instead'のエラーが継続して発生する
parma
質問済み 4ヶ月前
Site-to-Site VPNでIPフラグメンテーションを回避するためのMTU調整はVGW側では不要なのか
承認された回答
daisukeikari
質問済み 3ヶ月前
Amazon S3 でバケットポリシーを変更しようとすると表示される「You don't have permissions to edit bucket policy」というエラーをトラブルシューティングするにはどうすればよいですか?
AWS公式更新しました 1年前
トンネル B よりもトンネル A を優先するように、Site-to-Site VPN 接続を設定するにはどうすればよいですか?
AWS公式更新しました 2年前
API Gateway からの「Execution failed due to configuration error: Invalid endpoint address」(設定エラーにより実行が失敗しました: 無効なエンドポイントアドレスです) というエラーをトラブルシューティングするにはどうすればよいですか?
AWS公式更新しました 2年前
仮想プライベートゲートウェイで終端する AWS Site-to-Site VPN 接続を使用しているのに、VPC に接続できないのはなぜですか?
AWS公式更新しました 1年前
AWS Application Migration Service (MGN) と MGN vCenter クライアントを利用してエージェントレスで VMware 仮想環境 (VMware Cloud on AWS) 上の仮想マシンを Amazon EC2 に移行する
エキスパート
Koichi Takeda
公開済み 3日前
アベイラビリティーゾーン (AZ) の移行&インスタンスのアップグレードガイド
エキスパート
Sumikawa_M
公開済み 2ヶ月前