2개 답변
- 최신
- 최다 투표
- 가장 많은 댓글
1
By formatting, I assume you mean font size and style (e.g. bold, italic)? Currently Textract does not extract information on this type of formatting.
The DetectText API currently provides the following information (source):
- The lines and words of detected text
- The relationships between the lines and words of detected text
- The page that the detected text appears on
- The location of the lines and words of text on the document page
It can also extract tables, forms, and specific information through queries. This page provides a good overview of the output you can expect.
답변함 일 년 전
0
Thank you very much for your explanation ! Given that Textract has very high accuracy in terms of correctly recognizing the characters, this would be a great feature to add.
답변함 일 년 전