Bedrock fine tuning ValidationError

1

While trying to create a custom model fine tuning job (model Cohere:command), it displays this error: "Validation error: After preprocessing, duplicates and large examples were removed.At least 32 examples needed if passing only train file"

The train file contains >100 rows JSONL format. The avg length of 'completion' objects is 187 chars. The avg length of 'prompt' objects is 500 chars.

Performed multiple attempts with different datasets with similar result.

2 Risposte
0

The error indicates that after preprocessing, the number of examples in the training file dropped below the minimum required for model training. A few things to check:

  1. Make sure the training file contains at least 32 valid examples after preprocessing. Empty or duplicate rows get removed during this step.
  2. Check that the format of each example is valid as per documentation. Each example should contain 'prompt' and 'completion' fields with required data.
  3. Try increasing the number of examples in the training file. AWS recommends a minimum of 1000 examples for optimal model performance.

Referring to the AWS documentation on data format and requirements can help troubleshoot issues with example count or format causing validation errors. Let me know if preprocessing the data differently helps resolve the error.

AWS
con risposta 2 mesi fa
profile picture
ESPERTO
verificato 2 mesi fa
0

In addition to the above suggestions, if the issue persists at your end please feel free to open a case with AWS Premium Support for further investigation.

AWS
con risposta 2 mesi fa

Accesso non effettuato. Accedi per postare una risposta.

Una buona risposta soddisfa chiaramente la domanda, fornisce un feedback costruttivo e incoraggia la crescita professionale del richiedente.

Linee guida per rispondere alle domande