Serverless LLMs

0

I've kinda multiple questions to ask

Can I deploy a model serverlessly on Sagemaker? My current requirements are some pretrained models with around 8-12GB of weights and stuff.
What is the best approach for LLM deployment and functioning on sagemaker? if I wanna plug some models from github and choose to manage them here on sagemaker. I might need fine-tuning thing along the way.

主题

无服务器机器学习和人工智能

标签

无服务器 Amazon SageMaker 机器学习和人工智能

语言

English

已提问 3 个月前176 查看次数

1 回答

最新
投票最多
评论最多

这些答案有用吗？为正确答案投票，以帮助社区从您的知识中受益。

0

Hi,

Yes, you can deploy models in serverless mode. Look at this blog post for all details: https://aws.amazon.com/blogs/machine-learning/deploying-ml-models-using-sagemaker-serverless-inference-preview/

For your own deployments, see this other blog post: https://aws.amazon.com/blogs/machine-learning/efficiently-train-tune-and-deploy-custom-ensembles-using-amazon-sagemaker/

Best,

Didier

专家

已回答 3 个月前

相关内容

SageMaker Model Monitor 执行错误 “There are missing columns in current dataset. Number of columns in current dataset: 1, Number of columns in baseline constraints: 225”
专家
rePost Polyglot
已提问 8 个月前
SageMaker Model Monitor 可以应用于 NLP 模型吗？
专家
rePost Polyglot
已提问 8 个月前
使用Sagemaker创建Huggingface模型错误，Sagemaker endpoint 一直处于“创建”状态。
专家
rePost Polyglot
已提问 8 个月前
How can I control the columns of AWS Billing CSV file?我应该如何控制AWS账单CSV文件中的字段呢？
rePost-User-1766999
已提问 1 年前
如果在我尝试安装库时出现生命周期配置超时，如何确保手动安装的库在 Amazon SageMaker 中持久存在？
AWS 官方已更新 2 年前
如何使用 CloudFormation 中的 AWS Serverless Application Model (SAM) 模板解决循环依赖关系？
AWS 官方已更新 3 年前
如何解决 Amazon ECS 中的“[AWS service] was unable to place a task because no container instance met all of its requirements”错误？
AWS 官方已更新 1 年前
我该如何解决将自定义容器带到 Amazon SageMaker 进行训练或推理时出现的问题？
AWS 官方已更新 2 年前
保留不健康的 Auto Scaling 实例 -- 分离不健康的 ASG 实例而不是终止
支持工程师
Tim
已发布 3 天前
通过自定义终止策略，让AWS自动伸缩组（ASG）始终终止最旧的实例
支持工程师
Tim
已发布 3 天前