2 Answers
- Newest
- Most votes
- Most comments
1
To add to it, you can set the BatchStrategy to MultiLine
in order to speed up the processing.
General guideline is - number of workers/instances is a multiple of number of files in S3 to process.
If MaxConcurrentTransforms is set to 0 or left unset, Amazon SageMaker checks the optional execution-parameters to determine the settings for your chosen algorithm
answered 2 years ago
0
It partitions the Amazon S3 objects in the input by key. Please checkout https://docs.aws.amazon.com/sagemaker/latest/dg/batch-transform.html
answered 2 years ago
Relevant content
- asked 2 years ago
- how to configure ideal value for MaxConcurrentTransforms in setting up a sagemaker batch transform ?Accepted Answerasked 2 years ago
- AWS OFFICIALUpdated 2 years ago
- AWS OFFICIALUpdated 2 years ago
- AWS OFFICIALUpdated 7 months ago
- AWS OFFICIALUpdated 3 years ago
@AWS-Anonymous - thanks. I am inquiring if there is a way we can figure out a value for MaxConcurrentTransforms when we bring our own model/algorithm/container. can the value equal be to # of cpu available in whatever instance type we configure?