Textract Table Odd Behavior on 1 out of 5 pages

0

I'm using Textract to parse table data in PDFs. Most of the pages are parsed out correctly:

Enter image description here

But the last page table has the columns split out in strange ways:

Enter image description here

I've got a config-driven process that relies on the index of a column to determine where to get the needed values. It's causing problems when the index for a value in one part of the table all of a sudden changes. Any ideas on how to get around this? I haven't seen this happen previously and it's kind of a bummer.

已提问 2 年前242 查看次数
1 回答
0

Hi, thanks for reaching out. Could you please provide your document so we can reproduce on our end? If there is sensitive information in the document, please file a customer support ticket.

AWS
已回答 2 年前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则