1 個回答
- 最新
- 最多得票
- 最多評論
1
Textract did update the table model to support merged_cells and table_headers. https://aws.amazon.com/about-aws/whats-new/2022/03/amazon-textract-updates-tables-check-detection/
The update adds a new BlockType called "MERGED_CELLS" and Relationships Type "MERGED_CELL" and an EntityType "COLUMN_HEADER". If you don't need those, you can ignore them.
Outside of those additions the response is the same as the "older" one with all CELLs of a TABLE being the CHILD Relationship. See: https://docs.aws.amazon.com/textract/latest/dg/how-it-works-tables.html
I recommend using https://pypi.org/project/amazon-textract-response-parser/ for parsing the response in Python.
已回答 2 年前
相關內容
- AWS 官方已更新 1 年前
- AWS 官方已更新 8 個月前