Comprehend.detectDominantLanguage text length & Confidence

0

Hello,

How long should be a correct text so that AWS SDK Comprehend.detectDominantLanguage to surely detect its language (minimum number of words) ?

The use case is the detected text fragments from travel pictures. How is Comprehend language detection influenced by the fact that some words from picture, and thus from phrase, might not be correctly extracted ? Will Comprehend guess the misspelled words, or ignore them ?

The answer should lead to a pre-process of the text, so that only high chances text to be sent to Comprehend. How should input text be pre-processed (classified) considering the detection Confidence from Rekognition ? Which is the average Rekognition.Confidence of words for a whole text, so that to be successfully detected by Comprehend.detectDominantLanguage ?

Thank you,
Mihai

  • Hello,

    Regarding the last question (Which is the average Rekognition.Confidence of words for a whole text, so that to be successfully detected by Comprehend.detectDominantLanguage ?), I think a better (practical) one could be :

    Which is the minimum length and Confidence of a Rekognition.LINE from an image, so that its language to be successfully detected by Comprehend.detectDominantLanguage ?

    Sorry for the vagueness of questions and thank you,
    Mihai

已提問 8 個月前檢視次數 183 次
1 個回答
0
已接受的答案

The API's minimum requirement for a request is 20 characters. https://docs.aws.amazon.com/comprehend/latest/APIReference/API_DetectDominantLanguage.html

For questions about accuracy and integration with other services (Amazon Rekognition), please cut a ticket to be investigated with the necessary resources.

AWS
已回答 7 個月前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南