Low GPU RAM VM options and pricing

Question

Hi,

I am trying to get some pricing on some of the low GPU RAM AWS VM options with like 4 gb gpu ram, 8 gb gpu ram, etc. The pricing options I've seen so far either only have the CPU specs or 24 GB GPU RAM and above.

What are the names of some low GPU RAM VM's on AWS I should consider? And what is their associated pricing?

Thank you.

Answer

Thanks for your response. The use case is deep learning model inference. Our models only have 1 gb to 4 gb gpu ram utilization.

The AWS calculator has no way to filter by GPU RAM, so I've been looking at: [https://instances.vantage.sh/?cost_duration=annually&reserved_term=yrTerm1Standard.allUpfront&selected=a1.2xlarge,g4ad.2xlarge,g4dn.8xlarge,g4dn.4xlarge]()

g4ad.xlarge with GPU RAM of 8 GiB has an on demand cost of $3315.9228 annually which appears to be the cheapest GPU VM option provided by AWS from my review.

Is this correct or is there a cheaper option? I think by using quantization or using smaller models we can get GPU RAM utilization below 4 gb. Is there no machines available with a cheaper price point taking into account that we only have GPU RAM utilization of up to 4 gb?

Thank you,

Aaron

Answer

Can you share the use case you have in mind? Is this for gaming, video editing or ML inference?

You can refer to [documentation GPU instances](https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/accelerated-computing-instances.html#gpu-instances) for overview of EC2 instances with GPUs. For pricing, you can check [On-demand pricing](https://aws.amazon.com/ec2/pricing/on-demand/) or use [AWS Calculator](https://calculator.aws/#/createCalculator/ec2-enhancement). There are different pricing options such as [Savings Plan](https://aws.amazon.com/savingsplans/)

Some of the cost effective instance types include [g5g](https://aws.amazon.com/ec2/instance-types/g5g/),  [g4dn](https://aws.amazon.com/ec2/instance-types/g4/#Amazon_EC2_G4dn_Instances) and [g4ad](https://aws.amazon.com/ec2/instance-types/g4/#Amazon_EC2_G4ad_instances). They start with 4 vCPUs and 16 GB RAM.

EDIT: You can go to EC2 console, Instance Types and filter by EC2 with GPUs (screenshot below). 
![Enter image description here](/media/postImages/original/IMQEaiAPsUTxqC2R72t9Hh7Q)

Note that g4ad use AMD GPU. The g5g is on ARM64 architecture, comes with NVIDIA T4g and has 8 GB RAM

Do you need to run your inference 24 by 7? If not, on-demand may be more cost effective; stop and start the instance as required.

You may want to consider [SageMaker](https://aws.amazon.com/sagemaker/) especially [Serverless Inference](https://docs.aws.amazon.com/sagemaker/latest/dg/serverless-endpoints.html). Refer to [Pricing page](https://aws.amazon.com/sagemaker/pricing/) for pricing model.

Low GPU RAM VM options and pricing

相關內容