Error: The provided entity lists contained only generic or high-frequency words. Please provide additional unique entity names for training

0

I was trying to train Comprehend to identify my custom types. When I submit a training job to identify my custom entities it comes up with the error below.

"The provided entity lists contained only generic or high-frequency words. Please provide additional unique entity names for training."

What does the error really mean?

I had created 5 entity type each having 200 different text and then nearly 40 documents of training data. Each of the training data were store as separate files in the s3 bucket. I provided spearate S3 bucket names for both the entity.csv and for the 40 documents stored as txt files

Any help will be appreciated

posta 2 anni fa473 visualizzazioni
1 Risposta
0

Comprehend will filter out stop words, such as "the", from the provided list of entity names. If Comprehend finds no samples for a given entity type after this filtering, then it will result in the error message mentioned. Please make sure you aren't using stop words for entity names.

AWS
con risposta 2 anni fa

Accesso non effettuato. Accedi per postare una risposta.

Una buona risposta soddisfa chiaramente la domanda, fornisce un feedback costruttivo e incoraggia la crescita professionale del richiedente.

Linee guida per rispondere alle domande