Is one supposed to grayscale and brightness contrast process the image before sending to textract?

0

Textract results on recognizing basic arithmetic seems to degrade with color

This series of images show Textract failing unusually in all cases except the one where the image has been both grayscale and brightness/contrast (50/50 and 25/25)

Is one supposed to grayscale the image before sending to textract? Should one also apply brightness/contrast?

I assume Textract was trained with grayscale images - so should the service automatically convert the input images to grayscale?

ina
已提問 2 年前檢視次數 574 次
2 個答案
0

Grayscaling and contrast-correcting images before sending to Textract should not be "required" and the service is trained and evaluated on a broad variety of OCR tasks.

However, as you've found, there are definitely cases where adding your own pre-processing if possible can help boost accuracy - sometimes with significant improvement. For example you might also explore skew detection and correction if that's relevant to your images, as I've seen customers benefit from it before.

It's worth mentioning that Amazon Textract is optimized more towards document extraction use-cases, while the Rekognition DetectText API may perform better in situations where the background is busier and there are fewer than 100 words to detect. I'm not sure how well Rekognition performs with handwriting (it's not explicitly mentioned in the doc from what I can see), but might be worth trying out to compare since your images seem to have relatively little text.

AWS
專家
Alex_T
已回答 2 年前
0

Grayscale and brightness contrast processing is not a pre-requisite for using Textract. There may be different use cases where pre-processing will produce better results but it is not required.

已回答 2 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南