Questions tagged with Amazon SageMaker Deployment
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
Hi,
I'm encountering an issue with deploying a SageMaker endpoint that was previously working fine. I successfully deployed the Nous Hermes Llama 2 7B model to a g5.2xlarge endpoint about a week ago....
0
answers
0
votes
94
views
asked 10 months agolg...
We are fine-tuning stable diffusion model on a custom dataset and looking for the DreamBooth approach for training on Sagemaker. Is it possible on Sagemaker. if yes then can you give me some links or...
5
answers
0
votes
673
views
asked 10 months agolg...
Trying to get codegen25-7b-multi to launch on Sagemaker and hitting issues trying to launch on 2xlarge, 8xlarge and 12xlarge instances. All are throwing the same errors:
Error #1...
1
answers
0
votes
281
views
asked 10 months agolg...
Running into issues in getting Starcoder to deploy on Sagemaker.
I'm getting the following errors in CloudWatch and even with the instance type: ml.g5.8xlarge
Error 1:
```
Error: ShardCannotStart
...
2
answers
0
votes
399
views
asked a year agolg...
Is it possible (and efficient) to deploy an LLM model serverlessly using Sagemaker? I'm concerned about the performance and costs involved? The ML application doesn't receive a lot of requests.
2
answers
0
votes
2150
views
asked a year agolg...
I have been trying to deploy this HuggingFace model ( https://huggingface.co/bigcode/starcoderplus/tree/main ) to AWS Sagemaker but failed.
The error message from Cloudwatch is
> "No safetensors...
1
answers
0
votes
992
views
asked a year agolg...
I am encountering an issue while trying to calculate the AWS Signature for my requests. I have been following the AWS documentation and various examples, but I keep getting the following error:
"The...
2
answers
0
votes
814
views
asked a year agolg...
What type of instances can support models with 11B parameters ? I need to use this model for inference jobs.
1
answers
0
votes
172
views
asked a year agolg...
I need some clarification related to hyperparameters :
a) Ways to evaluate best hyperparameters result and how those are linked with model
b) Ways to version control parameters for training jobs
Accepted AnswerAmazon SageMaker Deployment
2
answers
0
votes
204
views
asked a year agolg...
I have a machine learning classification model that was trained outside of SageMaker. The model is in Scikit-learn format. To run this model, the preprocessing step requires the binary content of a...
2
answers
0
votes
455
views
asked a year agolg...
I am setting up autoscaling for a realtime inference endpoint in sagemaker. I set up a load test using locust, and by setting relatively high numbers (i.e: 100 users, with 10 user spawned per seconds)...
1
answers
0
votes
761
views
asked a year agolg...
Hi team !
I need to deploy a ton of Machine Learning Models (Timeseries models) and I'm seeking a way that is effective.
In details, the problem is to build a platform capable of serving many time...
1
answers
0
votes
631
views
asked a year agolg...