Textract Bulk Uploader via Boto API

0

I have recently seen the news on the console Textract tool for Bulk Upload. Is there a way to use this service via the API? Like send a list via the startDocumentAnalysis?

profile picture
Zac Dan
asked a year ago470 views
2 Answers
2

Hi - It seems currently it is an AWS Management Console tool. It will help to evaluate the capabilities of Amazon Textract, using Bulk Document Uploader feature on the Amazon Textract console and enables you to quickly process your own set of documents without writing any code.

profile pictureAWS
EXPERT
answered a year ago
  • Got it so there is not way to bulk upload via the API rather than just a loop to call the endpoint for each file individually? Was hoping to batch my documents and get a response like that of the Bulk Uploader.

1
Accepted Answer

Hi, @AWS-user-Nitin is correct, it is not a new API, hence not a feature of the AWS CLI. The Textract Bulk Uploader AWS Web Console UI does actually call the StartDocumentAnalysis API multiple times and then for all the documents to finish. If you are looking to process larger number of documents, take a look at the https://github.com/aws-solutions-library-samples/guidance-for-low-code-intelligent-document-processing-on-aws, which offers a way to setup a workflow for even higher number of documents. Let me know what your use case is, file a ticket against the repo and I can provide a sample for that.

AWS
answered a year ago
  • Will do! Work flow is to take a large number of documents as a batch, process them with Textract + post processing lambda, upload into Dynamo with each row as a document record, and then summarize all records for a batch into a csv/excel.

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions