Is one supposed to grayscale and brightness contrast process the image before sending to textract?

0

Textract results on recognizing basic arithmetic seems to degrade with color

This series of images show Textract failing unusually in all cases except the one where the image has been both grayscale and brightness/contrast (50/50 and 25/25)

Is one supposed to grayscale the image before sending to textract? Should one also apply brightness/contrast?

I assume Textract was trained with grayscale images - so should the service automatically convert the input images to grayscale?

ina
asked 2 years ago565 views
2 Answers
0

Grayscaling and contrast-correcting images before sending to Textract should not be "required" and the service is trained and evaluated on a broad variety of OCR tasks.

However, as you've found, there are definitely cases where adding your own pre-processing if possible can help boost accuracy - sometimes with significant improvement. For example you might also explore skew detection and correction if that's relevant to your images, as I've seen customers benefit from it before.

It's worth mentioning that Amazon Textract is optimized more towards document extraction use-cases, while the Rekognition DetectText API may perform better in situations where the background is busier and there are fewer than 100 words to detect. I'm not sure how well Rekognition performs with handwriting (it's not explicitly mentioned in the doc from what I can see), but might be worth trying out to compare since your images seem to have relatively little text.

AWS
EXPERT
Alex_T
answered 2 years ago
0

Grayscale and brightness contrast processing is not a pre-requisite for using Textract. There may be different use cases where pre-processing will produce better results but it is not required.

answered 2 years ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions