2 Respostas
- Mais recentes
- Mais votos
- Mais comentários
1
By formatting, I assume you mean font size and style (e.g. bold, italic)? Currently Textract does not extract information on this type of formatting.
The DetectText API currently provides the following information (source):
- The lines and words of detected text
- The relationships between the lines and words of detected text
- The page that the detected text appears on
- The location of the lines and words of text on the document page
It can also extract tables, forms, and specific information through queries. This page provides a good overview of the output you can expect.
respondido há um ano
0
Thank you very much for your explanation ! Given that Textract has very high accuracy in terms of correctly recognizing the characters, this would be a great feature to add.
respondido há um ano