Questions tagged with Amazon Textract

Amazon Textract is a machine learning (ML) service that automatically extracts text, handwriting, and data from scanned documents.

Content language: English

Select up to 5 tags to filter
Sort by most recent

Browse through the questions and answers listed below or filter and sort to narrow down your results.

347 results
Textract is not detecting BlockType 'QUERY' or 'QUERY_RESULT' from some PDF files.. Have uploaded in AWS Textract environment in the webpage and is getting the output for the query question. But same ...
1
answers
1
votes
12
views
asked 13 hours ago
Hi, I need help with getting the Python code for extracting section_headers from a multi-page PDF.
1
answers
0
votes
28
views
asked 5 days ago
I am trying to build a document comparison application as a part of my work. The documents I work with contains a lot of text and tabular data. They are pdf files with around 100 pages . I want the a...
1
answers
0
votes
40
views
asked a month ago
Hi, I used Textract bulk document uploader to process over 9k documents. Now I need to download the processed files. However, I found I can only download 50 files each time. Is there any way to downlo...
Accepted AnswerAmazon Textract
1
answers
0
votes
30
views
asked a month ago
Hello Amaqzon Textract is quite a useful service to use. Pricing is clearly articulated here - https://aws.amazon.com/textract/pricing/ Only challenge is how to track the utilization. When I look ...
Accepted AnswerAmazon Textract
1
answers
0
votes
60
views
asked a month ago
1. Assume we've 50-250 data points that need to be extracted from PDF files. Each PDF file may be 4-15 pages. 2. The format and layout of each PDF file may be different. A datapoint we're searching fo...
2
answers
0
votes
41
views
asked a month ago
Amazon Textract's pricing page says that the Free Tier lasts for 3 months, but does it count 3 full months from the time you have the service? Or is it 3 calendar months? I started a free trial in the...
1
answers
0
votes
37
views
asked a month ago
I am developing a Bank Statement PDF Converter that converts PDF files into CSV format. I have resolved many bugs in the output and am nearing completion. However, during testing with sample files, I...
1
answers
0
votes
50
views
asked a month ago
Issue: textract identifies enumerated paragraph number as a different layout box, and misses the enumeration Example: ![phone contract example - `layout text 2` should be part of `layout text `](/medi...
Accepted AnswerAmazon Textract
1
answers
0
votes
31
views
AWS
asked a month ago
I'm implementing a form data capturer from images using AWS Textract, but ... ![Simple checkbox form](/media/postImages/original/IM8vx6TyPGRNCBEe4HcTl2ng) As you can see in the AWS test page, the mo...
2
answers
0
votes
44
views
asked 2 months ago
Seems like Textract is having problem extracting this part of the table (green box) ![Enter image description here](/media/postImages/original/IMugtZUKWjSDCAfESDxTv8IQ) The only data that Textract s...
1
answers
0
votes
70
views
asked 2 months ago
I am attempting to use TextractAsyncClient in Kotlin, but I can't find it. I can find it in Java and can make do if necessary, but is there really no native Kotlin implementation?
1
answers
0
votes
34
views
asked 2 months ago