How to implement LLM AI chatbot using vLLM in AWS with EKS

0

Hi guys,

I need some help and was wondering if anyone have some guideline steps or a guide for me to implement the Mixtral-8x7B-Instruct-v0.1 model (the LLM for my AI chatbot) using this GPU optimized library called vLLM (https://github.com/vllm-project/vllm) in AWS using EKS (Elastic Kubernetes Service).

This is urgent. Would really really appreciate any help with this. Many thanks.

Ps: if you have some sort of rough guidelines without the vLLM part, that would work too.

Leo
已提问 3 个月前72 查看次数
没有答案

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则