Textract doesn't detect few lines from top of the page.

0

Hi,

Sometimes Textract does not extract OCR from the top of the page. The following shows an example: Enter image description here The input image is 300 DPI. Is this a bug or I am missing a pre-process step or setting?

Thanks

已提問 2 年前檢視次數 244 次
1 個回答
0

Thanks for bringing up the issue. 300 DIP seems like a reasonable resolution at which OCR should work fine. However, I can confirm some of the details at the top are missing. So, in this case, could you please reach out to the Textract team via a support case citing quality issue - and provide the redacted document to help the team debug what exactly is happening with the document. Thanks.

AWS
Rohan_K
已回答 2 年前
  • Hi,

    I found there is a bug in Textract API. If the page has a barcode or QR code a the bottom of the page, It won't pick up few lines from the top. If I remove the barcode from the page, then it will report back all the the text in the document.

    Thanks.

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南