1 回答
- 最新
- 投票最多
- 评论最多
0
Hi Ajay,
Inf1 is inference only and not intended to allow gradient calculation. You should look at the use of torch XLA (using torch_neuronx) on inf2 or trn1 for this, this environment is intended for training tasks (as well as inference) and gradients are calculated there. This is also likely better suited to your embedding output too.
You may want to look at https://awsdocs-neuron.readthedocs-hosted.com/en/latest/frameworks/torch/torch-neuronx/tutorials/training/mlp.html, but instead load the bert model of interest, using the XLA device.
已回答 10 个月前
相关内容
- AWS 官方已更新 2 年前