1 Answer
- Newest
- Most votes
- Most comments
0
To deploy a model on Inf1 and Inf2 instances, you need to compile the model using AWS Neuron. In this documentation page you will find the updated list of Supported models for AWS Inferentia2, AWS Inferentia, and also AWS Trainium.
If you want to deploy Stable Diffusion on AWS Inferentia2, please see this blogpost for a full walkthrough.
Hope this helps.
answered 8 months ago
Relevant content
- asked a year ago
- AWS OFFICIALUpdated 4 months ago
- AWS OFFICIALUpdated 8 months ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated 3 months ago