Questions tagged with AWS Inferentia

Content language: English

Select up to 5 tags to filter

Sort by most recent

Filter Questions by

AllAnsweredUnansweredNo Answer

Browse through the questions and answers listed below or filter and sort to narrow down your results.

TensorFlow for Trainium instances?

Hi, Is there more documentation/examples for *TensorFlow* on Trn1/Trn1n instances? Documentation at: [https://awsdocs-neuron.readthedocs-hosted.com/en/latest/frameworks/tensorflow/index.html]()...

TensorFlow on AWS Machine Learning & AI AWS Inferentia AWS Trainium

answers

votes

292

views

Arnoud

asked a month ago

tensorflow.neuron only compiles a small subset of all operations

We are using tensorflow.neuron to compile a tensorflow 1.x SavedModel to run on AWS Inferentia machines on EC2. We do this by calling: tensorflow.neuron.saved_model.compile(model_dir,...

Accepted AnswerAmazon EC2 AWS Inferentia

answers

votes

408

views

patriko

asked 2 months ago

Cannot compile model for inferentia: ValueError: Attempt to convert a value (None) with an unsupported type (<class 'NoneType'>) to a Tensor.

Currently, I host my model with `tensorflow_model_server`. Here is how I export my model: ``` model = tf.keras.models.load_model("model.hdf5") def __decode_images(images, nch): o =...

AWS Inferentia

answers

votes

464

views

Mosi

asked 6 months ago

How to use Neuron SDK

I am new to AWS Neuron SDK and the documentation seems confusing to me. There is no direct guide on how to install the SDK and use it to compile models. The examples are outdated and the installation...

AWS Inferentia

answers

votes

581

views

Mosi

asked 6 months ago

Best practice for using Inferentia in ECS

Currently, we are using Elastic Inference for inferencing on AWS ECS. We use `inference_accelerators` in `ecs.Ec2TaskDefinition` to set up elastic inference. For scaling, we are monitoring...

Amazon Elastic Container Service AWS Inferentia

answers

votes

520

views

Mosi

asked 6 months ago

Which service is suitable for me?

I have a project where I would like to send inference requests. For this I need a API as AWS Lambda or a SageMaker endpoint so that the customer can send their request there. The inference performed...

Amazon SageMaker Machine Learning & AI AWS Inferentia

answers

votes

677

views

Paul

asked 7 months ago

Installing linux headers (DLAMI AMI) failed in INF2 installation guide

Hello, i'm using **pytorch-inference:2.0.1-gpu-py310-cu118-ubuntu20.04-sagemaker** AMI and following this guide...

AWS Inferentia AWS Neuron Amazon SageMaker Studio

answers

votes

495

views

Ana

asked 7 months ago

Using Pytorch 2.0.1 on INF2 instance in Sagemaker

hello, i'm using **pytorch-inference:2.0.1-gpu-py310-cu118-ubuntu20.04-sagemaker** image to run the service i'd like to switch to INF2 instance. ~~I think i can try to use...

Amazon SageMaker AWS Inferentia AWS Neuron

answers

votes

547

views

Ana

asked 8 months ago

TensorFlow instance

I'm considering launching an instance to work on one of my TensorFlow models since my current PC doesn't perform efficiently. My PC has 32GB of RAM, a 20CPU i7 processor, and an RTX 3050Ti 20GB GPU. I...

TensorFlow on AWS Machine Learning & AI Compute AWS Inferentia

answers

votes

526

views

scholar

asked 8 months ago

Inferentia scaling activity error

Hello, I am using an autoscaling group with inferentia chips but I encounter some problems during the deployment. There are three available zones in my ASG which means that those zones must contain...

AWS Auto Scaling Amazon EC2 AWS Inferentia Availability

answers

votes

395

views

KookaS

asked 8 months ago

Can we host AWS JumpStart Foundation models directly on AWS Inf1 or Inf2 Instances ?

As the title says, we can host LLM's and Stable diffusion models from jumpstart directly on SageMaker Inf1 or Inf2 chips ? > I tried doing that with Stable Diffusion 2 Model (i.e from studio...

Amazon SageMaker Machine Learning & AI AWS Inferentia Amazon SageMaker JumpStart

answers

votes

728

views

Yogendra

asked 10 months ago

Issue with Falcon/Falcoder while trying to use it on AWS EC2 Inferentia 2.8xlarge Instance

We are facing issues while using this model on the aforementioned machine. We were able to run the same experiment on G5 instance successfully but we are observing that the same code is not working on...

AWS Deep Learning AMIs Amazon EC2 Machine Learning & AI Compute AWS Inferentia

answers

votes

547

views

Amlan

asked a year ago

1
2
3
4
12 / page