Textract doesn't detect few lines from top of the page.

0

Hi,

Sometimes Textract does not extract OCR from the top of the page. The following shows an example: Enter image description here The input image is 300 DPI. Is this a bug or I am missing a pre-process step or setting?

Thanks

已提问 2 年前244 查看次数
1 回答
0

Thanks for bringing up the issue. 300 DIP seems like a reasonable resolution at which OCR should work fine. However, I can confirm some of the details at the top are missing. So, in this case, could you please reach out to the Textract team via a support case citing quality issue - and provide the redacted document to help the team debug what exactly is happening with the document. Thanks.

AWS
Rohan_K
已回答 2 年前
  • Hi,

    I found there is a bug in Textract API. If the page has a barcode or QR code a the bottom of the page, It won't pick up few lines from the top. If I remove the barcode from the page, then it will report back all the the text in the document.

    Thanks.

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则

相关内容