Questions tagged with AWS Inferentia

Content language: English

Select up to 5 tags to filter

Sort by most recent

Filter Questions by

AllAnsweredUnansweredNo Answer

Browse through the questions and answers listed below or filter and sort to narrow down your results.

tensorflow.neuron only compiles a small subset of all operations

We are using tensorflow.neuron to compile a tensorflow 1.x SavedModel to run on AWS Inferentia machines on EC2. We do this by calling: tensorflow.neuron.saved_model.compile(model_dir,...

Accepted AnswerAmazon EC2 AWS Inferentia

answers

votes

128

views

patriko

asked a month ago

Cannot compile model for inferentia: ValueError: Attempt to convert a value (None) with an unsupported type (<class 'NoneType'>) to a Tensor.

Currently, I host my model with `tensorflow_model_server`. Here is how I export my model: ``` model = tf.keras.models.load_model("model.hdf5") def __decode_images(images, nch): o =...

AWS Inferentia

answers

votes

174

views

Mosi

asked 5 months ago

How to use Neuron SDK

I am new to AWS Neuron SDK and the documentation seems confusing to me. There is no direct guide on how to install the SDK and use it to compile models. The examples are outdated and the installation...

AWS Inferentia

answers

votes

281

views

Mosi

asked 5 months ago

Best practice for using Inferentia in ECS

Currently, we are using Elastic Inference for inferencing on AWS ECS. We use `inference_accelerators` in `ecs.Ec2TaskDefinition` to set up elastic inference. For scaling, we are monitoring...

Amazon Elastic Container Service AWS Inferentia

answers

votes

237

views

Mosi

asked 6 months ago

Which service is suitable for me?

I have a project where I would like to send inference requests. For this I need a API as AWS Lambda or a SageMaker endpoint so that the customer can send their request there. The inference performed...

Amazon SageMaker Machine Learning & AI AWS Inferentia

answers

votes

391

views

Paul

asked 6 months ago

Installing linux headers (DLAMI AMI) failed in INF2 installation guide

Hello, i'm using **pytorch-inference:2.0.1-gpu-py310-cu118-ubuntu20.04-sagemaker** AMI and following this guide...

AWS Inferentia AWS Neuron Amazon SageMaker Studio

answers

votes

217

views

Ana

asked 6 months ago

Using Pytorch 2.0.1 on INF2 instance in Sagemaker

hello, i'm using **pytorch-inference:2.0.1-gpu-py310-cu118-ubuntu20.04-sagemaker** image to run the service i'd like to switch to INF2 instance. ~~I think i can try to use...

Amazon SageMaker AWS Inferentia AWS Neuron

answers

votes

256

views

Ana

asked 7 months ago

TensorFlow instance

I'm considering launching an instance to work on one of my TensorFlow models since my current PC doesn't perform efficiently. My PC has 32GB of RAM, a 20CPU i7 processor, and an RTX 3050Ti 20GB GPU. I...

TensorFlow on AWS Machine Learning & AI Compute AWS Inferentia

answers

votes

245

views

scholar

asked 7 months ago

Inferentia scaling activity error

Hello, I am using an autoscaling group with inferentia chips but I encounter some problems during the deployment. There are three available zones in my ASG which means that those zones must contain...

AWS Auto Scaling Amazon EC2 AWS Inferentia Availability

answers

votes

129

views

KookaS

asked 7 months ago

Can we host AWS JumpStart Foundation models directly on AWS Inf1 or Inf2 Instances ?

As the title says, we can host LLM's and Stable diffusion models from jumpstart directly on SageMaker Inf1 or Inf2 chips ? > I tried doing that with Stable Diffusion 2 Model (i.e from studio...

Amazon SageMaker Machine Learning & AI AWS Inferentia Amazon SageMaker JumpStart

answers

votes

432

views

Yogendra

asked 9 months ago

Issue with Falcon/Falcoder while trying to use it on AWS EC2 Inferentia 2.8xlarge Instance

We are facing issues while using this model on the aforementioned machine. We were able to run the same experiment on G5 instance successfully but we are observing that the same code is not working on...

AWS Deep Learning AMIs Amazon EC2 Machine Learning & AI Compute AWS Inferentia

answers

votes

271

views

Amlan

asked 10 months ago

Deploy Inference Endpoint for HG Model Pygmalion

**Can someone help me load my model to create an endpoint?** Provided explanation of steps followed, error logs and code used to create everything...thank you in advance. I'm trying very hard to...

Amazon SageMaker Machine Learning & AI Amazon Elastic Inference AWS Inferentia

answers

votes

508

views

softwarecraftmanship

asked a year ago

1
2
3
4
12 / page