All Content tagged with AWS Neuron
AWS Neuron is a software development kit (SDK) for running machine learning inference using AWS Inferentia chips.
Content language: English
Select up to 5 tags to filter
Sort by most recent
EXPERT
published 20 days ago1 votes74 views
Please, guide me how to install neuron runtime on ubuntu 22.04?
I always get cannot locate collectives after running this
"sudo apt-get install aws-neuronx-collectives=2.* -y"
I have WSL Ubuntu...
EXPERT
published 3 months ago3 votes1076 views
EXPERT
published 7 months ago1 votes1051 views
Hello, i'm using **pytorch-inference:2.0.1-gpu-py310-cu118-ubuntu20.04-sagemaker** AMI and following this guide...
hello,
i'm using **pytorch-inference:2.0.1-gpu-py310-cu118-ubuntu20.04-sagemaker** image to run the service i'd like to switch to INF2 instance.
~~I think i can try to use...
Hi,
One big difference I've noticed between the inf1 and inf2 development experience is that with inf2, we are not able to compile models outside of inf2 instances (unlike inf1 where we were able to...
Which version of torch_neuronx is used in the example?
This is asked because neither the API Reference at [aws...
I am trying to run inference of GPT2 on inf2 instance using this transformers-neuronx example:
[https://github.com/aws-neuron/transformers-neuronx#hugging-face-generate-api-support]()
I keep getting...
I am currently facing an issue with the AWS Neuron SDK when trying to run the PyTorch example provided in the AWS Neuron GitHub repository on a Deep Learning AMI Neuron PyTorch 1.13 (Ubuntu 20.04)...
Pytorch 2.0.1 is backward compatible so it shall be fine to use it but when we tried to we got this error.
ImportError
Invalid dependency version torch==2.0.1+cu117. Expected torch==1.13.1
```
#...
The audio going into the model would be different lengths, and I saw a post saying "At the time of writing, the AWS Neuron SDK does not support dynamic shapes, which means that the input size needs to...