Textract Table Odd Behavior on 1 out of 5 pages

0

I'm using Textract to parse table data in PDFs. Most of the pages are parsed out correctly:

Enter image description here

But the last page table has the columns split out in strange ways:

Enter image description here

I've got a config-driven process that relies on the index of a column to determine where to get the needed values. It's causing problems when the index for a value in one part of the table all of a sudden changes. Any ideas on how to get around this? I haven't seen this happen previously and it's kind of a bummer.

demandé il y a 2 ans242 vues
1 réponse
0

Hi, thanks for reaching out. Could you please provide your document so we can reproduce on our end? If there is sensitive information in the document, please file a customer support ticket.

AWS
répondu il y a 2 ans

Vous n'êtes pas connecté. Se connecter pour publier une réponse.

Une bonne réponse répond clairement à la question, contient des commentaires constructifs et encourage le développement professionnel de la personne qui pose la question.

Instructions pour répondre aux questions