Regarding Amazon Textract big file process

0

I want to use Amazon Textract process my PDF file. My file have lot of images so file size expecting 50MB to 300MB what is the max file size I can process use Amazon Textract?. One more question also If Amazon Textract not support MS Word(doc,docx format) how to process those file , Do I need to convert to PDF and process ? Please advise Thanks in advance

2개 답변
1

You can find the Textract limits on the AWS documentation https://docs.aws.amazon.com/textract/latest/dg/limits.html

For PDF the limits are 10MB and 1 page for synchronous operations. 500MB and 3000 pages for asynchronous operations.

About MS Word... are you sure that you need and OCR? You can access directly to the text of doc and docx files without using a OCR.

답변함 2년 전
  • Correction: It is actually 1 page for synchronous operations, not 10. Some customers benefit from the Tables, Forms or Queries features even for Word documents or use the OCR geometry information. You can convert to PDF and then run against Textract.

  • Thanks @Martin Schade AWS fixed.

0

Thanks for quick response. As per my understanding only single page PDF we can go for synchronous operations. More than one page PDF we need to go for asynchronous irrespective of file size. Is it correct? Please advise

답변함 2년 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠