Unanswered Questions tagged with Amazon SageMaker Model Training
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
Hello, I have started running a command to train a model using Ultralytics YOLOv8.2.4.
Most of the prerequisites should have already been installed. However whenever i run the cell, it will get stuck...
0
answers
0
votes
54
views
asked 3 days agolg...
what resources are required to train a GPT model with 10 billion parameters using 6 petabytes of data, assuming no hyperparameter tuning is performed? Specifically, how many GPUs would be needed and...
0
answers
0
votes
103
views
asked 2 months agolg...
The detailed StackOverflow question can be found in this [link](https://stackoverflow.com/questions/76821347/sagemaker-experiment-tracking-duplication/78142919#78142919)
I would like to initialize...
0
answers
0
votes
117
views
asked 2 months agolg...
do u know how to edit the metaparameters.json file before running an AutoML job. I can see it come out after an AutoML job is ran in the output s3 bucket. But how do I edit and run it again.
Or is...
0
answers
0
votes
98
views
asked 3 months agolg...
Hello, I am using the Sagemaker Jumpstart Flan-T5 XL model to create a LLM Chatbot. I have completed the step by step and additional advanced examples to deploy an untrained model. Now I would like...
0
answers
0
votes
1369
views
asked 6 months agolg...
I get an error when trying to create a model training job in SageMaker. Please see the error message attached![SageMaker error](/media/postImages/original/IMn0tI1aFKQmK0TMELnvO-ug). The error...
0
answers
0
votes
120
views
asked 6 months agolg...
Has any one ever tried to fine tune Whisper model with audio files with AWS ?
0
answers
0
votes
54
views
asked 7 months agolg...
I have run docker image with cmd
docker run -d -p 8501:8500 tensorflow/serving
And also for selenium/hub
docker run -d -p 4446:4444 selenium/hub
where PORT 4446 for selenium/hub is enabled means...
0
answers
0
votes
185
views
asked 7 months agolg...
Hello fiends,
I tried to create a training job with an input data that stored in FSX lustre file system in eu-east-1, while the sagemaker training job in US-east-1. However it is always give me an...
0
answers
0
votes
97
views
asked 7 months agolg...
What should be the boolean value for Reinitialize top layer and Train only the top layer in the finetuning process for SSD MobileNet 1.0 model? Is the SSD model training required only on PNG images?
0
answers
0
votes
88
views
asked 9 months agolg...
I am trying to set up a data quality job for a batch transfer model. All inputs look right to me but the job does not kick off and displays above error when job is...
0
answers
0
votes
101
views
asked 9 months agolg...
I used the training script from [https://sagemaker.readthedocs.io/en/stable/frameworks/xgboost/using_xgboost.html](here),and trying to train the model. And the here is the code I used for configuring...
0
answers
0
votes
204
views
asked a year agolg...