Sagemaker ml.g5.2xlarge instances not working as desired due to nvidia-drivers issue

0

Over the weekend my sagemaker ml.g5.2xlarge started failing with the following errors: -> RuntimeError: No CUDA GPUs are available -> NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

wipwai
已提問 2 個月前檢視次數 292 次
1 個回答

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南