跳至內容

Unexpected ThrottlingException and Quota Showing as 0 Across All Regions and Models

0

Hello,

We recently created a new AWS account and have been actively using it for just over a month. Our application relies on the Nova 2 Sonic and Nova Micro models, and everything was functioning as expected under the default quotas.

However, we have suddenly started encountering the following error for all prompts:

ThrottlingException: Too many tokens per day, please wait before trying again.

Upon further investigation, we noticed that across all AWS accounts within our organization, the TPM (tokens per minute) and RPM (requests per minute) quotas for all models and all regions appear to be set to 0.

At the same time, the system still shows the default quota values (e.g., 8,000,000 tokens). When we attempt to request a quota increase for Nova Micro, we receive the following validation error: "Must be a number greater than your current quota value of 8000000."

This seems inconsistent with the observed behavior (i.e., effective quota being 0 and requests being throttled).

Could you please help clarify:

  • Why the effective TPM/RPM quotas are showing as 0 across all accounts and regions?
  • Why we are unable to request a quota increase despite encountering throttling errors?
  • What steps we should take to restore normal quota functionality?

Any guidance would be greatly appreciated.

Thank you.

已提問 2 個月前檢視次數 55 次
1 個回答
2

Hello.

Below is a question from someone who is experiencing the same problem as you.
According to the response, it seems that communication with AWS support will be necessary.
https://repost.aws/ja/questions/QUf16LkLwNS2yRbKVufdu6FA/new-account-stuck-at-0-tpm-rpm-for-all-bedrock-models-despite-aws-default-showing-5-000-000-tpm-provisioning-issue

Please contact AWS support via the following URL.
When making an inquiry, I recommend selecting a category as shown in the image below.
https://support.console.aws.amazon.com/support/home#/
a

專家
已回答 2 個月前
專家
已審閱 2 個月前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。