2 Respuestas
- Más nuevo
- Más votos
- Más comentarios
1
By formatting, I assume you mean font size and style (e.g. bold, italic)? Currently Textract does not extract information on this type of formatting.
The DetectText API currently provides the following information (source):
- The lines and words of detected text
- The relationships between the lines and words of detected text
- The page that the detected text appears on
- The location of the lines and words of text on the document page
It can also extract tables, forms, and specific information through queries. This page provides a good overview of the output you can expect.
respondido hace un año
0
Thank you very much for your explanation ! Given that Textract has very high accuracy in terms of correctly recognizing the characters, this would be a great feature to add.
respondido hace un año
Contenido relevante
- OFICIAL DE AWSActualizada hace un año
- OFICIAL DE AWSActualizada hace 2 años