Specify pages to extract from pdf with C# AWS SDK in Textract

0

I have a program written which performs a call to Textract to get tables in multi-page pdfs. This has been working great so far. The problem I have run into is that I now have pdfs where I only need certain tables on specific pages, and I am having trouble figuring out how to set the "Pages" property in the QueriesConfig in the StartDocumentAnalysisRequest. A simple example of this, from the StartDocumentAnalysisRequest level, would be sufficient. The program is written in C# using Amazon.Textract and Amazon.Textract.Model

Thank you!

1 Antwort
0

Thank you for using Textract. Sorry to hear that you are facing issues. Currently, the pages parameter is applicable only for QUERIES feature type. TABLES feature is for all pages. A recommendation is to split the document and then only call Textract with TABLES feature for the pages that you are interested in.

AWS
beantwortet vor 2 Jahren

Du bist nicht angemeldet. Anmelden um eine Antwort zu veröffentlichen.

Eine gute Antwort beantwortet die Frage klar, gibt konstruktives Feedback und fördert die berufliche Weiterentwicklung des Fragenstellers.

Richtlinien für die Beantwortung von Fragen