- Newest
- Most votes
- Most comments
Given the volume of calls and the architecture that you're implementing I would strongly recommend reaching out to your local AWS Solutions Architect who can help here. It's highly likely you're bumping into some soft limits in the Lamdba service which are designed to protect you from runaway processes but in this case they are not letting you scale the way you would like.
I've seen a lot higher Lambda invocation rates than this with other customers but it did require some work to get there in order to avoid limits on legacy back end systems that the Lambda functions used.
Lambda has two relevant limits: Concurrency limit, which is the max number of functions that can run at once, and Burst limit, which is how fast you can scale to the first limit. By default the concurrency limit is 1000 per account (can be easily increased), and the second limit is 3000/1000/500 burst depending on the region, with additional 500 per minute, this one is more complex to increase.
To overcome these limits I would recommend to move to an asynchronous architecture, in which you send all paragraphs to an SQS queue and let Lambda process the queue. The way Lambda scales when consuming from queue will resolve all scaling issue you are currently facing.
Also, as @Brettski-AWS recommended, talk to an AWS SA.
Relevant content
- asked a year ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated 3 years ago
- AWS OFFICIALUpdated a year ago