Sporadic real-time classification - how to do it cost-efficient?

0

Hello,

We want to sporadically classify large documents (up to 40 pages) in real-time. It works well using comprehend custom classification.

The problem now is that you need to keep alive an endpoint all the time for just a couple of requests per day. This is way too expensive.
I am under the impression that the synchronous classification was designed for high workloads only and does not provide a cost-effective way for an infrequent amount of requests.

Are there any cost-effective alternatives besides crafting a self-made algorithm using Sagemaker?

Kind regards
Thomas

질문됨 3년 전291회 조회
2개 답변
0

Hello. Thanks for reaching out. There are a few options here. If you know when you expect to get sporadically high usage, you can set up time based Application Autoscaling. This would increase your throughput during certain times of the day and then scale it down.
If your work is not predictable by time, you can set up endpoint utilization based Application Autoscaling. This would increase your throughput when your endpoint reaches a certain target utilization. Both these options require you to maintain at least 1 IU of throughput on your endpoint so you will continue to incur that minimum cost.
See here: https://docs.aws.amazon.com/comprehend/latest/dg/comprehend-autoscaling.html

If your workload is not large enough to maintain 1 IU, you could consider programmatically deleting and creating an endpoint after/before your workload is expected. Note that endpoint creation takes a few minutes so you need to start the endpoint creation with enough time before your workload is expected.

답변함 3년 전
0

Thank you for clarification

답변함 3년 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠