Specify pages to extract from pdf with C# AWS SDK in Textract

0

I have a program written which performs a call to Textract to get tables in multi-page pdfs. This has been working great so far. The problem I have run into is that I now have pdfs where I only need certain tables on specific pages, and I am having trouble figuring out how to set the "Pages" property in the QueriesConfig in the StartDocumentAnalysisRequest. A simple example of this, from the StartDocumentAnalysisRequest level, would be sufficient. The program is written in C# using Amazon.Textract and Amazon.Textract.Model

Thank you!

1 回答
0

Thank you for using Textract. Sorry to hear that you are facing issues. Currently, the pages parameter is applicable only for QUERIES feature type. TABLES feature is for all pages. A recommendation is to split the document and then only call Textract with TABLES feature for the pages that you are interested in.

AWS
已回答 2 年前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则