跳至內容

AWS Bedrock nova pro concurrency limit

0

Hi team, I would like to ask about the concurrency limit for the Bedrock Nova Pro model. I have already enabled cross-regional inference in Ohio, and AWS has increased the RPM to 2500.

已提問 2 個月前檢視次數 124 次
2 個答案
0

For AWS Bedrock Nova Pro, there are specific concurrency limits that govern how many requests can be processed simultaneously.

While the sources don't explicitly state the exact concurrent request limit for Nova Pro specifically, they do indicate that Amazon Bedrock enforces concurrency limits on model inference requests. These limits are separate from the requests per minute (RPM) quota that you mentioned has been increased to 2500 in Ohio.

When dealing with high concurrent traffic for Nova Pro, you should be aware that:

  1. The default quota for Nova Pro in some regions (like Frankfurt) is only 50 requests per minute (RPM).

  2. The on-demand model inference requests per minute quota for Nova Pro is typically marked as "not adjustable" through the standard Service Quotas console, though you've already received a custom increase.

  3. Cross-region inference (which you've enabled) helps distribute traffic across multiple AWS Regions, increasing your overall throughput beyond single-region limits.

If you're experiencing throttling or need higher concurrency, you might consider:

  • Implementing request queuing mechanisms between your application and the model
  • Optimizing your application architecture, including batching requests where possible
  • Contacting AWS Support directly if you need further increases beyond what's already been granted

For specific concurrency needs beyond what's currently allocated, you should contact AWS Support directly as they can provide guidance tailored to your specific use case and account.
Sources
AWS Bedrock : Nova Pro with high concurrent traffic | AWS re:Post
Bedrock Quota For Nova Pro | AWS re:Post
Requests to Invoke Agent in Bedrock are being queued | AWS re:Post
How to increase AWS Bedrock quota for the specific model? | AWS re:Post

已回答 2 個月前
0

To view service quotas for any Amazon Bedrock models , do one of the following:

  1. Follow the steps at Viewing service quotas and select Amazon Bedrock as the service: https://docs.aws.amazon.com/servicequotas/latest/userguide/gs-request-quota.html
  2. Refer to the Amazon Bedrock service quotas in the AWS General Reference: https://docs.aws.amazon.com/general/latest/gr/bedrock.html#limits_bedrock

You can follow get approval of a quota increase request to increase limits: https://docs.aws.amazon.com/bedrock/latest/userguide/quotas-increase.html.

Source: https://docs.aws.amazon.com/bedrock/latest/userguide/quotas.html

AWS
管理員
已回答 2 個月前
AWS
專家
已審閱 2 個月前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。