- Newest
- Most votes
- Most comments
Just tried the sample invoice and it worked fine. It can also happen that the Console session already timed out, then you need to refresh the web page and log in again. Besides the recommendation from AlexR, you can also use the https://github.com/aws-samples/amazon-textract-textractor/, which makes it easier calling Textract and working on the response.
I cannot access the pdf you linked, but if you are using the Console in AWS, you might need to create the s3 bucket first, as shown bellow for the Region you are trying to extract the text from the pdf.
In alternative you can use the AWS CLI or the SDK to call the Textact APIs: https://docs.aws.amazon.com/textract/latest/dg/analyzing-document-text.html
PS: if you think this answer is the one what helped you, please Accept it
I have updated the question with the link for the file and webpage for the Demo Service. What I meant was using demo service of Textract with the pdf. Pdf file doesn't work, only image files.
Hi Tushar, did you try again after creating the s3 bucket with the pdf?
hi there, did you solve this issue . i also face the same problem when trying to upload a pdf and try to process it with the demo service. thanks for infos regards steffen
Relevant content
- asked 2 years ago
- AWS OFFICIALUpdated 3 years ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated 6 months ago
- AWS OFFICIALUpdated 6 months ago
I have updated the question with the link for the file and webpage for the Demo Service. What I meant was using demo service of Textract with the pdf. Pdf file doesn't work, only image files. Anyway, what about using Java, is there any GitHub resource similar like for Python that you have mentioned?
Hi Tushar, did you try again after creating the s3 bucket with the pdf?
The Textractor tool offers a command line interface, which you could use or you can check out our Java SDK samples for calling Textract: https://docs.aws.amazon.com/sdk-for-java/latest/developer-guide/java_textract_code_examples.html