us-west-2 g5g.xlarge repetitive sudden hang up

0

Hello. My G5g (Graviton2 - NVIDIA T4G) instance in us-west-2 region suddenly hangs up over and over again. Does anybody experience this?

It stated up fine. I could use it without problem, only for a while.

But over and over again, after around one hour from startup, it hanged up at all. It suddenly lost network connection.

  • Rebooting from management console did not resolve connectivity.
  • After stopping from management console, the instance was kept "Stopping" for more than 5 minutes or longer.
  • Force stop from management console only solve that situation.

I only use VirtualGL and TigerVNC server. I didn't notice any memory shortage before the hanging up.

質問済み 2年前245ビュー
3回答
0

I could reproduce exact the same problem also in ap-northeast-1 (Tokyo) region.

回答済み 2年前
0

It occurs with the both of NVIDIA official driver for aarch64. https://www.nvidia.com/en-us/drivers/unix/ Linux aarch64 Latest Production Branch Version: 470.94 Latest New Feature Branch Version: 495.46

回答済み 2年前
0

I made another trial with AWS provided AMI ami-0122dba335a03859e, Deep Learning AMI Graviton GPU CUDA 11.4.2 (Ubuntu 20.04) 20211119, without any update, in us-west-2. It could be running for more than 3 hours, looked fine. But after I started to use the GPU with VirtualGL + TigerVNC + Firefox to show threejs.org sample pages, it hanged up. The same symptom arose.

回答済み 2年前

ログインしていません。 ログイン 回答を投稿する。

優れた回答とは、質問に明確に答え、建設的なフィードバックを提供し、質問者の専門分野におけるスキルの向上を促すものです。

質問に答えるためのガイドライン

関連するコンテンツ