Skip to content

New account stuck at 0 TPM/RPM for all Bedrock models despite AWS default showing 5,000,000 TPM — provisioning issue

0

Hi AWS Community, I'm hoping someone can help or an AWS engineer can escalate this. I've been stuck on this for over 2 weeks now. Account situation: New AWS account, region ap-southeast-2 (Sydney), also tested in us-east-1. The Problem: Every single Anthropic model in Amazon Bedrock shows 0 TPM and 0 RPM in both the Bedrock Quotas console and Service Quotas. This includes Claude Haiku 4.5, Claude 3.5 Sonnet V2, and all other Anthropic models. When I check Service Quotas and search for "Haiku 4.5", I can clearly see:

AWS default quota value: 10,000 RPM / 5,000,000 TPM My applied account-level quota: 0 RPM / 0 TPM

This means my account was never seeded with the default quota values at provisioning time. Even a single "Say Hello" prompt in the Bedrock Playground returns: ThrottlingException: Too many tokens per day, please wait before trying again. What I've already tried:

Confirmed model access is enabled (Model Access page now retired; models auto-enable on first invoke) Tested in both ap-southeast-2 and us-east-1, and they have the same issue in both regions Raised two AWS Support cases on 09/03/2026 (Case IDs: *************** and ***************) - both sat Unassigned for 16 days with no response or contact, so I closed them and opened new ones today Submitted quota increase requests via the Service Quotas console Tried the Bedrock Playground directly, the same ThrottlingException

What I need: My account quota is to be seeded to the AWS published defaults:

Cross-region inference requests per minute for Claude Haiku 4.5: 10,000 RPM Cross-region inference tokens per minute for Claude Haiku 4.5: 5,000,000 TPM

I am not asking for anything above the published defaults — just what AWS documentation says new accounts should receive. Has anyone else experienced this? Is there a way to get this escalated beyond Basic Support? Thanks in advance.

*Edit: Removed case ID - Randi S.

3 Answers
2

It sounds like you may be hitting anomaly prevention.

Quotas protect customers and AWS. For example, a Bedrock TPM limit prevents customers from accidentally running up a huge bill. Additionally, AWS

Is this a new account that was recently setup?

AWS
answered 2 months ago
1

Hi there,

Sorry to hear about this.

We've reviewed your support cases and left internal notes expressing your urgency. We're unable to discuss any account-specific info over social media, so keep an eye on your cases for updates. The cases have been assigned and the support agent will respond as soon as possible.

- Reece W.

AWS
MODERATOR
answered 2 months ago
AWS
EXPERT
reviewed 2 months ago
  • All my 5 cases were closed with the same comment :

    Dear Customer,

    We've reviewed your Bedrock quota increase request. At this time, we recommend building more account history before we can process this type of increase. Our quota management system helps ensure a smooth scaling experience while protecting your account from unexpected costs. To better evaluate future requests, we recommend:

    Utilising your existing AWS services more extensively Establishing a consistent usage pattern over at least one billing cycle(s) Once you have built a more comprehensive usage history, we welcome you to submit a new request. For enterprise-scale implementations, consider:

    This is not really correct, as my quotas were set to 0 from the provisioning time. So Unexpected cost would never be an issue. The way it was provisioned is an issue here.

    I reopened those cases 21 days ago with confirmation that there was a bill payment established on this account, so please proceed with fixing my quotas to the default ones as they should be. Just to make it clear, I am not asking for a quota increase just to be provided with the default ones as per AWS

0

I understand your frustration with the quota provisioning issue. While I can't directly escalate your support cases, I can provide some context that might help you troubleshoot and communicate with AWS Support more effectively.

The throttling you're experiencing with the "Too many tokens per day" error, even with a simple "Say Hello" prompt, is likely related to how Amazon Bedrock reserves quota at the start of each request. When you make a request without explicitly setting the max_tokens parameter, Bedrock defaults to the model's maximum output capacity. For Claude models like Sonnet, this can be as high as 64,000 tokens.

This means that even though your actual prompt and expected response are tiny, the system reserves quota based on the maximum possible output. For example, a request with 1,000 input tokens and no max_tokens set would reserve 65,000 tokens (1,000 input + 64,000 default max output) at the request start, even if the actual output is only 100 tokens.

However, your core issue appears to be that your account quotas are showing 0 TPM/0 RPM instead of the published defaults. This is indeed a provisioning issue that requires AWS Support intervention. For your support cases:

  1. Emphasize that this is a quota provisioning issue, not a usage issue
  2. Specify that you need the account-level quotas initialized to AWS default values
  3. Reference the specific Service Quotas you're seeing (0 vs. the 5,000,000 TPM default)
  4. Consider opening a new case specifically categorized under "Account and Billing Support" > "Account" rather than technical support

Since you're on Basic Support and experiencing delays, you might also try reaching out through the AWS re:Post community or consider temporarily upgrading to Developer Support for faster response times on account-level issues.
Sources
TPM & RPM Quota Monitoring Dashboard for Amazon Bedrock | AWS re:Post

answered 2 months ago
AWS
MODERATOR
reviewed 2 months ago
  • The technical explanation regarding token reservation is accurate and helpful for overall understanding. However, it doesn't directly address the initial provisioning failure where the quota is stuck at 0 instead of the 5M default. The advice on support routing is practical and appreciated

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.