us-west-2 g5g.xlarge repetitive sudden hang up

0

Hello. My G5g (Graviton2 - NVIDIA T4G) instance in us-west-2 region suddenly hangs up over and over again. Does anybody experience this?

It stated up fine. I could use it without problem, only for a while.

But over and over again, after around one hour from startup, it hanged up at all. It suddenly lost network connection.

  • Rebooting from management console did not resolve connectivity.
  • After stopping from management console, the instance was kept "Stopping" for more than 5 minutes or longer.
  • Force stop from management console only solve that situation.

I only use VirtualGL and TigerVNC server. I didn't notice any memory shortage before the hanging up.

질문됨 2년 전245회 조회
3개 답변
0

I could reproduce exact the same problem also in ap-northeast-1 (Tokyo) region.

답변함 2년 전
0

It occurs with the both of NVIDIA official driver for aarch64. https://www.nvidia.com/en-us/drivers/unix/ Linux aarch64 Latest Production Branch Version: 470.94 Latest New Feature Branch Version: 495.46

답변함 2년 전
0

I made another trial with AWS provided AMI ami-0122dba335a03859e, Deep Learning AMI Graviton GPU CUDA 11.4.2 (Ubuntu 20.04) 20211119, without any update, in us-west-2. It could be running for more than 3 hours, looked fine. But after I started to use the GPU with VirtualGL + TigerVNC + Firefox to show threejs.org sample pages, it hanged up. The same symptom arose.

답변함 2년 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠