Questions tagged with Amazon SageMaker Deployment
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
ResourceLimitExceeded: An error occurred (ResourceLimitExceeded) when calling the CreateEndpoint operation: The account-level service limit 'Memory size in MB per serverless endpoint' is 3072 MBs,...
1
answers
0
votes
598
views
asked 10 months agolg...
Hi,
I'm encountering an issue with deploying a SageMaker endpoint that was previously working fine. I successfully deployed the Nous Hermes Llama 2 7B model to a g5.2xlarge endpoint about a week ago....
0
answers
0
votes
95
views
asked 10 months agolg...
We are fine-tuning stable diffusion model on a custom dataset and looking for the DreamBooth approach for training on Sagemaker. Is it possible on Sagemaker. if yes then can you give me some links or...
5
answers
0
votes
676
views
asked 10 months agolg...
Trying to get codegen25-7b-multi to launch on Sagemaker and hitting issues trying to launch on 2xlarge, 8xlarge and 12xlarge instances. All are throwing the same errors:
Error #1...
1
answers
0
votes
285
views
asked 10 months agolg...
Running into issues in getting Starcoder to deploy on Sagemaker.
I'm getting the following errors in CloudWatch and even with the instance type: ml.g5.8xlarge
Error 1:
```
Error: ShardCannotStart
...
2
answers
0
votes
403
views
asked a year agolg...
Is it possible (and efficient) to deploy an LLM model serverlessly using Sagemaker? I'm concerned about the performance and costs involved? The ML application doesn't receive a lot of requests.
2
answers
0
votes
2156
views
asked a year agolg...
I have been trying to deploy this HuggingFace model ( https://huggingface.co/bigcode/starcoderplus/tree/main ) to AWS Sagemaker but failed.
The error message from Cloudwatch is
> "No safetensors...
1
answers
0
votes
997
views
asked a year agolg...
I am encountering an issue while trying to calculate the AWS Signature for my requests. I have been following the AWS documentation and various examples, but I keep getting the following error:
"The...
2
answers
0
votes
821
views
asked a year agolg...
What type of instances can support models with 11B parameters ? I need to use this model for inference jobs.
1
answers
0
votes
173
views
asked a year agolg...
I need some clarification related to hyperparameters :
a) Ways to evaluate best hyperparameters result and how those are linked with model
b) Ways to version control parameters for training jobs
Accepted AnswerAmazon SageMaker Deployment
2
answers
0
votes
205
views
asked a year agolg...
I have a machine learning classification model that was trained outside of SageMaker. The model is in Scikit-learn format. To run this model, the preprocessing step requires the binary content of a...
2
answers
0
votes
458
views
asked a year agolg...
I am setting up autoscaling for a realtime inference endpoint in sagemaker. I set up a load test using locust, and by setting relatively high numbers (i.e: 100 users, with 10 user spawned per seconds)...
1
answers
0
votes
773
views
asked a year agolg...