SageMaker Neo Compilation - Unable to Neo Compile for FP16 and INT8 precision

I'm trying to Neo compile a Pytorch YoloV5 Large model for edge deployment on an Nvidia Jetson Xavier NX device. I'm able to do it using the default settings for FP32 precision but I'm unable to do it for FP16 or INT8 precision. I have tried passing it in "CompilerOptions" in the OutputConfig but the output of Neo compilation is still FP32.

How can I get a Neo compiled model for for FP16 and INT8 precision? Does Neo support these precision modes or not?

トピック

機械学習と AI

タグ

機械学習と AI Amazon SageMaker Edge Amazon SageMaker Deployment

言語

English

shitijkarsolia

質問済み 2年前240ビュー

1回答

新しい順
投票が多い順
コメントが多い順

これらの回答は役に立ちましたか？コミュニティがあなたの知識を活用できるように、正解に賛成票を投じてください。

Unfortunately Neo doesn't support quantization for Jetson Devices. It means you can only compile FP32 models and they will be FP32 after compilation.

I know this is not what you're looking for, but FYI, Neo supports int8 model optimization only for TFLite and targeting CPU not GPU. Check here some supported models: https://docs.amazonaws.cn/en_us/sagemaker/latest/dg/neo-supported-edge-tested-models.html

Samir Araujo

回答済み 2年前

関連するコンテンツ

how to create an instance VM server and how to make a plan for 3 years?
kou
質問済み 2ヶ月前
CodeDeployにてThe deployment failed because a non-empty field was discovered on your Auto Scaling group that Code Deploy does not currently support copying. Unsupported fields:が表示されデプロイが進まない
承認された回答
rePost-User-6968076
質問済み 1年前
RDS for MySQL 8.0.35でPlugin mysql_native_password reported: ''mysql_native_password' is deprecated and will be removed in a future release. Please use caching_sha2_password instead'のエラーが継続して発生する
parma
質問済み 4ヶ月前
How to add a policy to the permission policy of IAM roles that allows only users verified through the Cognito user pool to download files during S3 download.如何追加一个策略，使得该策略可以在S3下载时，仅允许通过Cognito用户池验证的用户
承認された回答
Dgk
質問済み 3ヶ月前
独自のカスタムコンテナをトレーニングや推論のために Amazon SageMaker で使用する際の問題をトラブルシューティングするにはどうすればよいですか?
AWS公式更新しました 2年前
「CloudFront wasn't able to connect to the origin」(CloudFront はオリジンに接続できませんでした) というエラーを解決するにはどうすればよいですか?
AWS公式更新しました 2年前
Amazon Cognito ユーザープール API から発生する「Unable to verify secret hash for client <client-id>」というエラーをトラブルシューティングする方法を教えてください。
AWS公式更新しました 3年前
Amazon SageMaker エンドポイントのレイテンシーをトラブルシューティングする方法を教えてください。
AWS公式更新しました 1年前
AWS Application Migration Service (MGN) と MGN vCenter クライアントを利用してエージェントレスで VMware 仮想環境 (VMware Cloud on AWS) 上の仮想マシンを Amazon EC2 に移行する
エキスパート
Koichi Takeda
公開済み 15時間前
アベイラビリティーゾーン (AZ) の移行&インスタンスのアップグレードガイド
エキスパート
Sumikawa_M
公開済み 2ヶ月前