Unanswered Questions tagged with Amazon SageMaker Model Training
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
what resources are required to train a GPT model with 10 billion parameters using 6 petabytes of data, assuming no hyperparameter tuning is performed? Specifically, how many GPUs would be needed and...
0
answers
0
votes
97
views
asked a month agolg...
The detailed StackOverflow question can be found in this [link](https://stackoverflow.com/questions/76821347/sagemaker-experiment-tracking-duplication/78142919#78142919)
I would like to initialize...
0
answers
0
votes
113
views
asked a month agolg...
do u know how to edit the metaparameters.json file before running an AutoML job. I can see it come out after an AutoML job is ran in the output s3 bucket. But how do I edit and run it again.
Or is...
0
answers
0
votes
95
views
asked 3 months agolg...
Hello, I am using the Sagemaker Jumpstart Flan-T5 XL model to create a LLM Chatbot. I have completed the step by step and additional advanced examples to deploy an untrained model. Now I would like...
0
answers
0
votes
1355
views
asked 5 months agolg...
I get an error when trying to create a model training job in SageMaker. Please see the error message attached![SageMaker error](/media/postImages/original/IMn0tI1aFKQmK0TMELnvO-ug). The error...
0
answers
0
votes
112
views
asked 6 months agolg...
Has any one ever tried to fine tune Whisper model with audio files with AWS ?
0
answers
0
votes
54
views
asked 7 months agolg...
I have run docker image with cmd
docker run -d -p 8501:8500 tensorflow/serving
And also for selenium/hub
docker run -d -p 4446:4444 selenium/hub
where PORT 4446 for selenium/hub is enabled means...
0
answers
0
votes
181
views
asked 7 months agolg...
Hello fiends,
I tried to create a training job with an input data that stored in FSX lustre file system in eu-east-1, while the sagemaker training job in US-east-1. However it is always give me an...
0
answers
0
votes
90
views
asked 7 months agolg...
What should be the boolean value for Reinitialize top layer and Train only the top layer in the finetuning process for SSD MobileNet 1.0 model? Is the SSD model training required only on PNG images?
0
answers
0
votes
79
views
asked 8 months agolg...
I am trying to set up a data quality job for a batch transfer model. All inputs look right to me but the job does not kick off and displays above error when job is...
0
answers
0
votes
96
views
asked 8 months agolg...
I used the training script from [https://sagemaker.readthedocs.io/en/stable/frameworks/xgboost/using_xgboost.html](here),and trying to train the model. And the here is the code I used for configuring...
0
answers
0
votes
199
views
asked a year agolg...
Im using sagemaker for train the data
It has pre-trained model
“tensorflow-od1-ssd-resnet50-v1-fpn-640x640-coco17-tpu-8”
**Create the SageMaker model instance. Note that we need to pass Predictor...
0
answers
0
votes
129
views
asked a year agolg...