how to inference parameters to a huggingface model hosted in sagemaker?

I created a model resource in sagemaker . the model is a tar file , downloaded from hugging face and fine tuned. based on the documentation provided ( sample code below) . the code sample is passing HF_TASK inference parameter and i assume this is hugging face specific, but is it possible to pass other parameters like padding or truncation and max_length ? such as padding : True truncation: True max_length = 512 ...

how do i pass these value?

import sagemaker 

hub = { 
   'HF_TASK' : 'text2text-generation'
}
role = sagemaker.get_execution_role()

huggingface_model = HuggingFaceModel( transformers_version='4.6.1', env=hub...

predictor = huggingface_model.deploy( ....

AWS-User-8567723
il y a 2 ans
If you are using a Pretrained model you may not be able to tweak params such as padding. I am not sure why do you want to do that while inferencing.

Sujets

Machine Learning et AI

Balises

Amazon SageMaker Machine Learning et AI

Langue

English

clouduser

demandé il y a 2 ans95 vues

Aucune réponse

Le plus récent
Le plus de votes
La plupart des commentaires

Contenus pertinents

account is currently blocked and not recognized as a valid account
Yves Boah
demandé il y a un an
Sagemaker/Support : Urgence et Prise en compte des tickets
Romuald
demandé il y a 4 mois
Erreur "Amazon Rekognition experienced a service issue." quand je souhaite entrainer mon model
rePost-User-1814428
demandé il y a un an
Data labeling sagemaker ground truth
Jose G
demandé il y a 6 mois
Comment résoudre les dépendances circulaires avec les modèles AWS Serverless Application Model (SAM) dans CloudFormation ?
AWS OFFICIELA mis à jour il y a 3 ans
Comment résoudre l'erreur AWS STS « the security token included in the request is expired » (le jeton de sécurité inclus dans la demande a expiré) lorsque j'utilise l'AWS CLI pour assumer un rôle IAM ?
AWS OFFICIELA mis à jour il y a 2 ans
Comment puis-je déployer un modèle Amazon SageMaker sur un autre compte AWS ?
AWS OFFICIELA mis à jour il y a 8 mois
Comment puis-je déterminer quelle instance de bloc-notes SageMaker a effectué un appel d'API particulier si toutes les instances utilisent le même rôle IAM ?
AWS OFFICIELA mis à jour il y a 2 ans