1 Answer
- Newest
- Most votes
- Most comments
1
You will using the regular autoscaling config outlined in the doc here to configure it for the SageMaker Async endpoint. There are no specifics for SageMaker.
First, you define the "aws_appautoscaling_target" with minimum and maximum capacities. Then go ahead and define your "TargetTrackingScaling" in the autoscaling policy
answered 2 years ago
@AWS_Raghu - thanks this is helpful. one follow up questions , in the original link i provided, in the clean up section , it states that we have to deregister the endpoint as a scalable target before deleting it (I have update my question to add clean up sample code ), I am assuming this is also not sagemaker specific, so can this be done via terraform?
Relevant content
- asked a year ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated 3 years ago
- AWS OFFICIALUpdated a month ago
- AWS OFFICIALUpdated a year ago
Have you tried this yet? Did you get an error. This is the right approach.
@AWS-User-0823707 - yes. it works. I still have few more follow up questions regarding this. do you have any experience in this?