Textract Table Odd Behavior on 1 out of 5 pages

0

I'm using Textract to parse table data in PDFs. Most of the pages are parsed out correctly:

Enter image description here

But the last page table has the columns split out in strange ways:

Enter image description here

I've got a config-driven process that relies on the index of a column to determine where to get the needed values. It's causing problems when the index for a value in one part of the table all of a sudden changes. Any ideas on how to get around this? I haven't seen this happen previously and it's kind of a bummer.

gefragt vor 2 Jahren242 Aufrufe
1 Antwort
0

Hi, thanks for reaching out. Could you please provide your document so we can reproduce on our end? If there is sensitive information in the document, please file a customer support ticket.

AWS
beantwortet vor 2 Jahren

Du bist nicht angemeldet. Anmelden um eine Antwort zu veröffentlichen.

Eine gute Antwort beantwortet die Frage klar, gibt konstruktives Feedback und fördert die berufliche Weiterentwicklung des Fragenstellers.

Richtlinien für die Beantwortung von Fragen