Questions tagged with Amazon Textract
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I started exploring Amazon Textract for my use case and Queries and possibly Custom Queries will be an important part of the solution.
I went through the pricing documentation, but could not find...
2
answers
0
votes
333
views
asked 5 months agolg...
I am using a Python lambda which is triggered by the Textract SNS message to read the results of my document analysis. Most of my PDFs that I process on a daily basis are only a few pages long, so the...
1
answers
0
votes
189
views
asked 5 months agolg...
Use Case : We are extracting data from a table to convert it into a json file
Issue : Despite of bold and clear letters , textract is unable to read ' X ' from the table elements (pls check the...
1
answers
0
votes
232
views
asked 6 months agolg...
I'm looking to start a project focused on text/meaning extraction from semi-structured to unstructured documents. There are specific categories of data that I'm looking for, but they may be presented...
2
answers
0
votes
774
views
asked 6 months agolg...
Hey, we were interested in using Amazon Textract to extract tables from our financial documents. However, we noticed that it falters when the document has watermarkd. Does AWS offer document cleaning...
1
answers
0
votes
242
views
asked 6 months agolg...
Hello,
Currently I have an LLM chatbot running on the sample code from streamlit via this workshop:...
1
answers
0
votes
291
views
asked 6 months agolg...
New to Textract. I am using the PHP SDK to access Amazon Textract for DocumentTextDetection (OCR) processing.
This is working so far by downloading the extracted text as local text file. However,...
3
answers
0
votes
569
views
asked 7 months agolg...
I had document of 4 pages where it was of some 200KB and the data got extracted correctly up to 3 pages but unable to extract the data from last page that is page 4 whereas when I uploaded the same...
1
answers
0
votes
148
views
asked 7 months agolg...
While using Textract python API for extracting data from some scanned pdf's the signatures are miss classified as line/word whereas same pdf when uploaded and tested from console its able to identify...
1
answers
0
votes
133
views
asked 7 months agolg...
Hello. We are trying to develop an app for which we want to use Textract to perform OCR on documents, but when uploading PDF documents to a bucket via the API it returns a JSON file with only the...
Accepted AnswerAmazon Textract
3
answers
0
votes
293
views
asked 7 months agolg...
For certain documents, Textract appears to be missing some columns, skipping entire tables, or incorrectly identifying tables. I have attempted to use some different pre-processing methods such as...
1
answers
0
votes
222
views
asked 7 months agolg...
We are struggling with textract query results. The OCR is working correctly, as the text is pulling accurately but the queries are returning incorrect results. We are reviewing education documents and...
2
answers
0
votes
219
views
asked 7 months agolg...