Is one supposed to grayscale and brightness contrast process the image before sending to textract?

0

Textract results on recognizing basic arithmetic seems to degrade with color

This series of images show Textract failing unusually in all cases except the one where the image has been both grayscale and brightness/contrast (50/50 and 25/25)

Is one supposed to grayscale the image before sending to textract? Should one also apply brightness/contrast?

I assume Textract was trained with grayscale images - so should the service automatically convert the input images to grayscale?

ina
질문됨 2년 전575회 조회
2개 답변
0

Grayscaling and contrast-correcting images before sending to Textract should not be "required" and the service is trained and evaluated on a broad variety of OCR tasks.

However, as you've found, there are definitely cases where adding your own pre-processing if possible can help boost accuracy - sometimes with significant improvement. For example you might also explore skew detection and correction if that's relevant to your images, as I've seen customers benefit from it before.

It's worth mentioning that Amazon Textract is optimized more towards document extraction use-cases, while the Rekognition DetectText API may perform better in situations where the background is busier and there are fewer than 100 words to detect. I'm not sure how well Rekognition performs with handwriting (it's not explicitly mentioned in the doc from what I can see), but might be worth trying out to compare since your images seem to have relatively little text.

AWS
전문가
Alex_T
답변함 2년 전
0

Grayscale and brightness contrast processing is not a pre-requisite for using Textract. There may be different use cases where pre-processing will produce better results but it is not required.

답변함 2년 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠