Fine-tune LLAMA2 with DPO (Direct Preference Optimization) in AWS

I'm exploring fine-tuning with DPO and successfully trained facebook/opt-model (HF model) with DPO (Ref: https://huggingface.co/blog/dpo-trl). As part of DPO training, I first performed SFT training, and using the final checkpoint I performed DPO training.

Now, I'm working on fine-tuning Llama2 with DPO in AWS. I have successfully fine-tuned Llama2 in AWS SageMaker Jumpstart, but stuck there figuring out how to perform DPO using the fine-tuned model artifact which is stored in S3 bucket.

It would be helpful if anyone could share some resources or insights on how to proceed DPO training in AWS. Thanks in advance!

Sujets

Machine Learning et AI

Balises

Machine Learning et AI Amazon SageMaker JumpStart

Langue

English

Jyothi

demandé il y a 5 mois1958 vues

Aucune réponse

Le plus récent
Le plus de votes
La plupart des commentaires

Contenus pertinents

Erreur "Amazon Rekognition experienced a service issue." quand je souhaite entrainer mon model
rePost-User-1814428
demandé il y a un an
LIghstail Instance in suspens
rePost-User-6587585
demandé il y a un an
account is currently blocked and not recognized as a valid account
Yves Boah
demandé il y a un an
error 403 / cloud front
nico-HPF
demandé il y a 2 mois
Comment résoudre les dépendances circulaires avec les modèles AWS Serverless Application Model (SAM) dans CloudFormation ?
AWS OFFICIELA mis à jour il y a 3 ans
Comment puis-je résoudre l'erreur de politique de clé KMS « Policy contains a statement with one or more invalid principals » (« La politique contient une instruction avec un ou plusieurs principaux non valides ») ?
AWS OFFICIELA mis à jour il y a 2 ans
Pourquoi est-ce que je reçois l'erreur « Access Denied with Status Code: 403 » (Accès refusé avec le code d'état : 403) AmazonS3Exception dans Amazon Athena lorsque j'interroge un compartiment dans un autre compte ?
AWS OFFICIELA mis à jour il y a 3 ans
Comment dois-je procéder si le message d'erreur « unresolved issues with your inputs » (« problèmes non résolus liés à vos entrées ») s'affiche dans AWS Batch lorsque je tente de supprimer mon environnement de calcul ?
AWS OFFICIELA mis à jour il y a 2 ans