1 Answer
- Newest
- Most votes
- Most comments
1
Textract Query may not be the best fit here given the structure of the document. You'll end up with inconsistent results. Here are the results of my tests so far with Query:
- What is the value of AB? --> <empty>
- What is the value of AF--> 12345
- What is the value of AA --> Capital souscrit non appelé
- What is the value of Capital souscrit non appelé? --> LLES
Once approach could be to leverage Tables and the merged cell feature that identifies cells that are merged horizontally or vertically. The screenshot below shows what I was able to get while testing the sample in the demo console using Tables.
Please check out the blog below for an example of how to use the merged cell construct in the AnalyzeDocument API's response. https://aws.amazon.com/blogs/machine-learning/merge-cells-and-column-headers-in-amazon-textract-tables/
answered a year ago
Relevant content
- asked a year ago
- asked 6 months ago
- asked 18 days ago
- AWS OFFICIALUpdated 2 months ago
- AWS OFFICIALUpdated 3 months ago
- AWS OFFICIALUpdated 2 years ago