All Content tagged with AWS Neuron

AWS Neuron is a software development kit (SDK) for running machine learning inference using AWS Inferentia chips.

Content language: English

Select up to 5 tags to filter
Sort by most recent
Please, guide me how to install neuron runtime on ubuntu 22.04? I always get cannot locate collectives after running this "sudo apt-get install aws-neuronx-collectives=2.* -y" I have WSL Ubuntu...
1
answers
0
votes
280
views
Zhenya
asked 3 months ago
Hello, i'm using **pytorch-inference:2.0.1-gpu-py310-cu118-ubuntu20.04-sagemaker** AMI and following this guide...
1
answers
0
votes
588
views
Ana
asked 10 months ago
hello, i'm using **pytorch-inference:2.0.1-gpu-py310-cu118-ubuntu20.04-sagemaker** image to run the service i'd like to switch to INF2 instance. ~~I think i can try to use...
1
answers
0
votes
662
views
Ana
asked a year ago
Hi, One big difference I've noticed between the inf1 and inf2 development experience is that with inf2, we are not able to compile models outside of inf2 instances (unlike inf1 where we were able to...
2
answers
0
votes
323
views
asked a year ago
Which version of torch_neuronx is used in the example? This is asked because neither the API Reference at [aws...
2
answers
0
votes
375
views
asked a year ago
I am trying to run inference of GPT2 on inf2 instance using this transformers-neuronx example: [https://github.com/aws-neuron/transformers-neuronx#hugging-face-generate-api-support]() I keep getting...
1
answers
0
votes
353
views
ria
asked a year ago
I am currently facing an issue with the AWS Neuron SDK when trying to run the PyTorch example provided in the AWS Neuron GitHub repository on a Deep Learning AMI Neuron PyTorch 1.13 (Ubuntu 20.04)...
1
answers
0
votes
562
views
IE32
asked a year ago
Pytorch 2.0.1 is backward compatible so it shall be fine to use it but when we tried to we got this error. ImportError Invalid dependency version torch==2.0.1+cu117. Expected torch==1.13.1 ``` #...
1
answers
0
votes
564
views
asked a year ago
The audio going into the model would be different lengths, and I saw a post saying "At the time of writing, the AWS Neuron SDK does not support dynamic shapes, which means that the input size needs to...
1
answers
0
votes
305
views
asked a year ago