1回答
- 新しい順
- 投票が多い順
- コメントが多い順
1
Textract did update the table model to support merged_cells and table_headers. https://aws.amazon.com/about-aws/whats-new/2022/03/amazon-textract-updates-tables-check-detection/
The update adds a new BlockType called "MERGED_CELLS" and Relationships Type "MERGED_CELL" and an EntityType "COLUMN_HEADER". If you don't need those, you can ignore them.
Outside of those additions the response is the same as the "older" one with all CELLs of a TABLE being the CHILD Relationship. See: https://docs.aws.amazon.com/textract/latest/dg/how-it-works-tables.html
I recommend using https://pypi.org/project/amazon-textract-response-parser/ for parsing the response in Python.
回答済み 2年前
関連するコンテンツ
- 質問済み 9ヶ月前
- AWS公式更新しました 1年前