- Newest
- Most votes
- Most comments
Grayscaling and contrast-correcting images before sending to Textract should not be "required" and the service is trained and evaluated on a broad variety of OCR tasks.
However, as you've found, there are definitely cases where adding your own pre-processing if possible can help boost accuracy - sometimes with significant improvement. For example you might also explore skew detection and correction if that's relevant to your images, as I've seen customers benefit from it before.
It's worth mentioning that Amazon Textract is optimized more towards document extraction use-cases, while the Rekognition DetectText API may perform better in situations where the background is busier and there are fewer than 100 words to detect. I'm not sure how well Rekognition performs with handwriting (it's not explicitly mentioned in the doc from what I can see), but might be worth trying out to compare since your images seem to have relatively little text.
Grayscale and brightness contrast processing is not a pre-requisite for using Textract. There may be different use cases where pre-processing will produce better results but it is not required.
Relevant content
- asked 2 years ago
- asked a year ago
- asked 2 years ago
- AWS OFFICIALUpdated 2 years ago
- AWS OFFICIALUpdated 8 months ago
- AWS OFFICIALUpdated 2 years ago
Just tried ReKognition... Colored: https://gyazo.com/8fc20ca073ca33e715af9c6348dc49da
Grayscale/Brightness/Contrast: https://gyazo.com/cb16c701b43738b7c533731d2550c901
Complete failure at https://gyazo.com/e3991e6e46b900e16e145a5a7481127e also