1 Antwort
- Neueste
- Die meisten Stimmen
- Die meisten Kommentare
0
Hi Ajay,
Inf1 is inference only and not intended to allow gradient calculation. You should look at the use of torch XLA (using torch_neuronx) on inf2 or trn1 for this, this environment is intended for training tasks (as well as inference) and gradients are calculated there. This is also likely better suited to your embedding output too.
You may want to look at https://awsdocs-neuron.readthedocs-hosted.com/en/latest/frameworks/torch/torch-neuronx/tutorials/training/mlp.html, but instead load the bert model of interest, using the XLA device.
beantwortet vor 10 Monaten
Relevanter Inhalt
- AWS OFFICIALAktualisiert vor 3 Jahren
- AWS OFFICIALAktualisiert vor einem Jahr