2 個答案
- 最新
- 最多得票
- 最多評論
1
By formatting, I assume you mean font size and style (e.g. bold, italic)? Currently Textract does not extract information on this type of formatting.
The DetectText API currently provides the following information (source):
- The lines and words of detected text
- The relationships between the lines and words of detected text
- The page that the detected text appears on
- The location of the lines and words of text on the document page
It can also extract tables, forms, and specific information through queries. This page provides a good overview of the output you can expect.
已回答 1 年前
0
Thank you very much for your explanation ! Given that Textract has very high accuracy in terms of correctly recognizing the characters, this would be a great feature to add.
已回答 1 年前