Textract missing some columns, tables


For certain documents, Textract appears to be missing some columns, skipping entire tables, or incorrectly identifying tables. I have attempted to use some different pre-processing methods such as extracting each page to an image file and rebuilding the PDFs, but this has not yielded any better results. Any suggestions are appreciated. Here are some examples, you can see the tables as identified by Textract highlighted in red:

Example #1 Example #2

asked 7 months ago218 views
1 Answer

Thank you for using Amazon Textract. As a managed machine learning service, we are continuously improving the quality of our models and releasing new features. In order to help us improve our models for your documents, please open a customer support ticket and share details to help us analyze further. Additionally, please look out for announcements regarding our model quality updates and new feature announcements that are announced on the AWS Textract what’s new post channel.

answered 7 months ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions