Hey! This one is a bit tricky, but if you understand how the JSON response is structured, you should be able to post-process the AnalyzeDocument API Json Response to get the data you need.
- The AnalyzeDocument API will return all the text it found in the document + the FeatureTypes you requested (In this case Forms)
- Each word is represented by an ID, and this ID is present in the Forms relationships.
- You will first have to process and save the Form relationship results
- Then you will have to delete from the response, the IDs of the words that correspond to the forms and will obtain the remaining raw text you are looking for.
Take a look at the Textractor aws sample, which can help you process the JSON results!
Hope this helps.
Send RAW-EMail in iOS-App with SWIFTasked 3 years ago
How to extract Tables and form data using textractasked 4 months ago
Textract - Extract form key values in reading orderasked a month ago
Text data cleaning in databrewasked 6 months ago
Trouble in node.js sending data from html form to the server.asked 7 months ago
[Announcement] Amazon Textract adds synchronous support for single page PDF documents and support for PDF documents containing JPEG 2000 encoded imagesasked 10 months ago
Text data type column not loading data for the values with hyphen and special charcterasked 7 months ago
Textract Form data + Raw dataasked 5 months ago
Extract only specific data from invoice using amazon textractasked 12 days ago
Why does Textract miss some data in PDF's?asked 3 months ago