Error: The provided entity lists contained only generic or high-frequency words. Please provide additional unique entity names for training

0

I was trying to train Comprehend to identify my custom types. When I submit a training job to identify my custom entities it comes up with the error below.

"The provided entity lists contained only generic or high-frequency words. Please provide additional unique entity names for training."

What does the error really mean?

I had created 5 entity type each having 200 different text and then nearly 40 documents of training data. Each of the training data were store as separate files in the s3 bucket. I provided spearate S3 bucket names for both the entity.csv and for the 40 documents stored as txt files

Any help will be appreciated

질문됨 2년 전473회 조회
1개 답변
0

Comprehend will filter out stop words, such as "the", from the provided list of entity names. If Comprehend finds no samples for a given entity type after this filtering, then it will result in the error message mentioned. Please make sure you aren't using stop words for entity names.

AWS
답변함 2년 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠