AWS OCR and Unicode

0

Does AWS OCR support Unicode characters? I want to scan pages that are written partially in the Apache language, which uses Unicode fonts for accents, tone, nazalizations and at least one non-Roman character, sometimes called the silent L or slashed L.

EricAZ
asked a year ago232 views
1 Answer
0

It would appear that your use case is not covered by Amazon Textract https://aws.amazon.com/textract/faqs/

Q: What type of text can Amazon Textract detect and extract?

Amazon Textract can detect printed text and handwriting from the Standard English alphabet and ASCII symbols. Amazon Textract can extract printed text, forms and tables in English, German, French, Spanish, Italian and Portuguese.

profile picture
EXPERT
Steve_M
answered a year ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions