How to compute gradient using BERt on inferentia

0

Hi All,

I have to compute gradient on BERT model on inferentia. For this I guess I also need access to the hidden layers. Im currently not able to proceed because of not finding literature on the net for inferentia. There seems to be some non inferentia links but those do not work for this case.

Amongst the things which are missing are how to get positional embeddings. Please let me know of any code snippets that may have the content Im looking for.

Thanks in advance Ajay

gefragt vor einem Jahr179 Aufrufe
1 Antwort
0

Hi Ajay,

Inf1 is inference only and not intended to allow gradient calculation. You should look at the use of torch XLA (using torch_neuronx) on inf2 or trn1 for this, this environment is intended for training tasks (as well as inference) and gradients are calculated there. This is also likely better suited to your embedding output too.

You may want to look at https://awsdocs-neuron.readthedocs-hosted.com/en/latest/frameworks/torch/torch-neuronx/tutorials/training/mlp.html, but instead load the bert model of interest, using the XLA device.

AWS
beantwortet vor 10 Monaten

Du bist nicht angemeldet. Anmelden um eine Antwort zu veröffentlichen.

Eine gute Antwort beantwortet die Frage klar, gibt konstruktives Feedback und fördert die berufliche Weiterentwicklung des Fragenstellers.

Richtlinien für die Beantwortung von Fragen