How to implement LLM AI chatbot using vLLM in AWS with EKS

0

Hi guys,

I need some help and was wondering if anyone have some guideline steps or a guide for me to implement the Mixtral-8x7B-Instruct-v0.1 model (the LLM for my AI chatbot) using this GPU optimized library called vLLM (https://github.com/vllm-project/vllm) in AWS using EKS (Elastic Kubernetes Service).

This is urgent. Would really really appreciate any help with this. Many thanks.

Ps: if you have some sort of rough guidelines without the vLLM part, that would work too.

No Answers

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions