- Newest
- Most votes
- Most comments
Thanks for using Textract and reaching out. Just want to check, are you using paginated client to get all results? Textract returns paginated result for maximum 1000 blocks so results in page 4 might be paginated. If that's not the case, have you tried to segment your 4th page out from the PDF and send that page to Textract and see if it returns any result? It is possible that if there are invalid content (e.g. JPEG 2000 image) on the page Textract would fail on extracting content on the whole page.
Thanks for the response.
To get the results, I'm just grabbing the blocks. I haven't used the paginated feature yet. I'll have to take a look into that with the 'next token' etc.
Otherwise, yes. I'll try the fourth page on it's own to see if that's where the issue is occurring.
Thanks!
I just tried it and you were exactly right. I had to grab the 'NextToken' to get the rest of the paginated results. Thanks again for your help!
Relevant content
- asked 2 months ago
- Accepted Answerasked 10 months ago
- AWS OFFICIALUpdated 4 months ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated 3 months ago