Salta al contenuto

API limits and cost for Q business chatSync API

0

We are planning on building a chatbot with the help of Q Business using custom UI. This chatbot will interact with Q business via chatSync API. On server side the API call is authenticated by using a session aware id token returned by Cognito. We wanted the following information -

  1. Since Q business usage is charged based on number of users, will each session aware temporary credential count as an individual user? We are using the same Identity Center application to exchange tokens
  2. What is the throttling limit for the chatSync API calls?
1 Risposta
2
Risposta accettata

Hi,

To your question-1, since the Subscription tiers are assigned to users through the Amazon Q Business console and the chatSync API calls are connected to this specific userID i.e. they are augmented with an IAM Identity Center token. The billing should be as per the subscriptions added, as defined in Amazon Q Business pricing.

To your question-2, I would suggest reaching out to AWS support to understand the specific throttling limits for the chatSync API. As per Service quotas for Amazon Q Business, this limit is 5 Maximum number of queries per second (QPS) per index if you use the UI and this limit is adjustable via Support. So, best to reach out to them for further clarity.

Hope this is helpful!

Thanks, Rama

AWS
ESPERTO
con risposta 2 anni fa
ESPERTO
verificato 2 anni fa

Accesso non effettuato. Accedi per postare una risposta.

Una buona risposta soddisfa chiaramente la domanda, fornisce un feedback costruttivo e incoraggia la crescita professionale del richiedente.