- Newest
- Most votes
- Most comments
The error you're encountering suggests there's a mismatch between the region specified in your ARN and the region where you're making the API call. Here are a few things to check and adjust:
-
Ensure that the region in your boto3 client matches the region in your model ARN. In your code, you're using an ARN for eu-west-3, so make sure your boto3 client is also configured for eu-west-3.
-
For cross-region inference, you need to use an inference profile instead of directly specifying the model ID. The ARN you're using in the modelId parameter appears to be for an inference profile, which is correct. However, make sure this inference profile exists and is correctly set up in your account.
-
Verify that the inference profile you're using includes both the source region (where you're making the API call) and the target region (where the model is hosted).
-
Double-check that you have the necessary permissions to use cross-region inference and that you've been granted access to the specific model you're trying to use.
-
Ensure that batch inference is supported for the specific model and regions you're working with. Some models or regions might have limitations or might not support batch inference.
If you've confirmed all these points and are still facing issues, you may need to contact AWS support for further assistance, as there could be a specific configuration or account-level issue that needs to be addressed.
Sources
Bedrock Batch Inference with Claude | AWS re:Post
Bedrock API invocation error - on demand throughput isn's supported | AWS re:Post
Bedrock API with Cross Region Inference is not working | AWS re:Post
Relevant content
- asked a year ago
- AWS OFFICIALUpdated 6 months ago
- AWS OFFICIALUpdated 5 months ago
