[BUG] Unable to retrieve attribute for AWS::OpenSearchService::Domain, with error message Internal error occurred, due to throttling

0

Posting https://github.com/aws/aws-cdk/issues/18239 here for increased visibility.

Stacks using attributes of Opensearch domains will randomly fail to deploy. The message shown in Cloudformation is similar to "Unable to retrieve Arn attribute for AWS::OpenSearchService::Domain, with error message Internal error occurred".

The actual cause as per discussion in that issue is that the DescribeDomain calls are being throttled, and Cloudformation is not handling the throttling sensibly.

This is a pernicious bug because of it's unpredictability. In my case, I've deployed my application successfully many times, and have never seen this issue before. I hit this issue whilst upgrading from Elasticsearch 7.10 to Opensearch 1.2; I was making no other changes. I'm stuck unable to rollback cleanly either, as it's not possible to go from Opensearch 1.2 to Elasticsearch 7.10. So I'm having to skip rolling back the cluster, and attempt to roll forward until either it succeeds, or I'm forced to hardcode the domain attributes I should be looking up.

According to the CDK issue, the internal tracking code for this issue is V498467686.

質問済み 2年前108ビュー
回答なし

ログインしていません。 ログイン 回答を投稿する。

優れた回答とは、質問に明確に答え、建設的なフィードバックを提供し、質問者の専門分野におけるスキルの向上を促すものです。

質問に答えるためのガイドライン

関連するコンテンツ