Sample Test Service of Textract not working with pdfs, only working with image files

1

I was trying the demo service of AWS Textract with my invoice, which were in '.pdf' format. But when I uploaded it, it didn't work, no results were shown. So I tried with invoice in image format (like .png, .jpg etc), and it works! Its written that the service supports pdfs, has, png etc, but why didn't it worked for pdfs?

The demo service of AWS Textract is this: https://us-east-1.console.aws.amazon.com/textract/home?region=us-east-1#/demo

you can check the invoice in pdf here: https://www.dropbox.com/s/082rrvbcmmruovv/PFI%20050.pdf?dl=0

asked a year ago1011 views
3 Answers
1

Just tried the sample invoice and it worked fine. It can also happen that the Console session already timed out, then you need to refresh the web page and log in again. Besides the recommendation from AlexR, you can also use the https://github.com/aws-samples/amazon-textract-textractor/, which makes it easier calling Textract and working on the response.

AWS
answered a year ago
1

I cannot access the pdf you linked, but if you are using the Console in AWS, you might need to create the s3 bucket first, as shown bellow for the Region you are trying to extract the text from the pdf. Enter image description here

In alternative you can use the AWS CLI or the SDK to call the Textact APIs: https://docs.aws.amazon.com/textract/latest/dg/analyzing-document-text.html

PS: if you think this answer is the one what helped you, please Accept it

profile pictureAWS
answered a year ago
  • I have updated the question with the link for the file and webpage for the Demo Service. What I meant was using demo service of Textract with the pdf. Pdf file doesn't work, only image files.

  • Hi Tushar, did you try again after creating the s3 bucket with the pdf?

0

hi there, did you solve this issue . i also face the same problem when trying to upload a pdf and try to process it with the demo service. thanks for infos regards steffen

sthe
answered 10 months ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions