Elasticache Redis (cluster mode enabled) NXDOMAIN error

0

Hello!

I've started new Elasticache Redis cluster (cluster mode enabled) with 1 shard and 2 nodes in it, cache.t4g.micro, using terraform module by CloudPosse.

After couple of weeks second node lost it's DNS record.

First node is ok:

host zzz-0001-001.zzz.yyy.euw1.cache.amazonaws.com

zzz-0001-001.zzz.yyy.euw1.cache.amazonaws.com has IPv6 address aaaa:aaa:aaaa:aaa:aaaa:aaaa:aaaa:aaaa

But second node returns dns NXDOMAIN error:

host zzz-0001-002.zzz.yyy.euw1.cache.amazonaws.com

zzz-0001-001.zzz.yyy.euw1.cache.amazonaws.com not found: 3(NXDOMAIN)

It started only after a couple of weeks after initial start.

I see this error after connect via teleport (via redis-cli the same error is shown):

tsh db connect --db-user my-user zzz

localhost:65293> keys *
(error) ERR Teleport: dial tcp: lookup zzz-0001-002.zzz.yyy.euw1.cache.amazonaws.com on [fdf3:aaaa:aaaa::a]:53: no such host
(0.76s)
rifler
已提問 2 個月前檢視次數 136 次
2 個答案
1
已接受的答案

Answer from AWS support:

On 2024-03-26 at 18:07:00 UTC, ElastiCache applied patching update elasticache-20240225-arm for your replication group named qqq in yyy region. Due to a bug , ipv6 address for your node qqq-0001-001 was removed from the endpoint.We have since fixed the issue for your cluster and ipv6 address has been updated for your replication group.

rifler
已回答 25 天前
profile picture
專家
已審閱 2 天前
0

This is not the answer you expect, but when cluster mode is enabled, the Configuration endpoint is used, so I don't think you would use the per-node endpoint directly.

https://docs.aws.amazon.com/AmazonElastiCache/latest/red-ug/Endpoints.html

Of course, the per-node endpoint can be named resolved. If the node's endpoint cannot resolve the name, it may be that the node is currently unavailable. Check the node's status in the management console.

profile picture
專家
shibata
已回答 2 個月前
profile picture
專家
已審閱 2 個月前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南