Hello. Thanks for reaching out. There are a few options here. If you know when you expect to get sporadically high usage, you can set up time based Application Autoscaling. This would increase your throughput during certain times of the day and then scale it down.
If your work is not predictable by time, you can set up endpoint utilization based Application Autoscaling. This would increase your throughput when your endpoint reaches a certain target utilization. Both these options require you to maintain at least 1 IU of throughput on your endpoint so you will continue to incur that minimum cost.
See here: https://docs.aws.amazon.com/comprehend/latest/dg/comprehend-autoscaling.html
If your workload is not large enough to maintain 1 IU, you could consider programmatically deleting and creating an endpoint after/before your workload is expected. Note that endpoint creation takes a few minutes so you need to start the endpoint creation with enough time before your workload is expected.
Relevant content
- asked 2 months ago
- AWS OFFICIALUpdated 2 months ago
- AWS OFFICIALUpdated 8 months ago
- AWS OFFICIALUpdated 2 years ago