All Content tagged with AWS Inferentia
AWS Inferentia is designed to provide high performance inference in the cloud, to drive down the total cost of inference, and to make it easy for developers to integrate machine learning into their business applications.
Content language: English
Filter content
Select tags to filter
Sort by
Sort by most recent
57 results
Markus AdhiwiyogoEXPERT
published 7 months ago0 votes197 views
New Training & Certification Badge from AWS
Markus AdhiwiyogoEXPERT
published 10 months ago0 votes159 views
Virtual training on choosing the optimal infrastructure for Small Language Models
[AWS Neuron Documentation](https://awsdocs-neuron.readthedocs-hosted.com/en/latest/general/setup/neuron-setup/multiframework/multi-framework-ubuntu22-neuron-dlami.html#setup-ubuntu22-multi-framework-d...
1
answers
0
votes
168
views
asked a year ago
Burtoft, JimEXPERT
published a year ago1 votes457 views
Walk through the options for compiling a model for inference using Inferentia or Trainium. You would need to do this if the model or the configuration you want isn't available in the Hugging Face cac...
KamranEXPERT
published a year ago3 votes5.4K views
Step by Step guide to deploy DeepSeek R1 Distilled models.
Burtoft, JimEXPERT
published a year ago0 votes559 views
A list of resources to use when you are first starting with the Neuron SDK and Inferentia or Trainium instances.
Burtoft, JimEXPERT
published a year ago0 votes1.2K views
Steps to set up Jupyter notebooks and VS Code remote server on Trainium and Inferentia Neuron systems.
KamranEXPERT
published a year ago0 votes1.4K views
Key announcements and discover how industry leaders, like Apple and Anthropic, are revolutionizing AI with AWS Trainium and Inferentia
Burtoft, JimEXPERT
published a year ago0 votes1.3K views
Get started with Inferentia and Trainium on EC2 using the Hugging Face Neuron Deep Learning Amazon Machine Image (AMI). A short walkthrough of how to deploy an EC2 image with all the Neuron drivers a...
KamranEXPERT
published 2 years ago0 votes1.2K views
Are you heading to **AWS re:Invent 2024** and looking for AWS Inferentia and Trainium sessions to take your machine learning skills to the next level?
Burtoft, JimEXPERT
published 2 years ago1 votes2.9K views
See what regions have instances, and find out how to generate your own list with a python script.
Hello AWS team!
I am trying to run a suite of inference recommendation jobs leveraging NVIDIA Triton Inference Server on a set of GPU instances (ml.g5.12xlarge, ml.g5.8xlarge, ml.g5.16xlarge) as well...
1
answers
0
votes
946
views
asked 2 years ago