How to implement LLM AI chatbot using vLLM in AWS with EKS

0

Hi guys,

I need some help and was wondering if anyone have some guideline steps or a guide for me to implement the Mixtral-8x7B-Instruct-v0.1 model (the LLM for my AI chatbot) using this GPU optimized library called vLLM (https://github.com/vllm-project/vllm) in AWS using EKS (Elastic Kubernetes Service).

This is urgent. Would really really appreciate any help with this. Many thanks.

Ps: if you have some sort of rough guidelines without the vLLM part, that would work too.

Sem respostas

Você não está conectado. Fazer login para postar uma resposta.

Uma boa resposta responde claramente à pergunta, dá feedback construtivo e incentiva o crescimento profissional de quem perguntou.

Diretrizes para responder a perguntas