1 Antwort
- Neueste
- Die meisten Stimmen
- Die meisten Kommentare
0
Your usecase sounds very close to the core Textract workflow - check out this AWS ML blog post that provides a solution that generates searchable PDF's, including bounding boxes for the text and such
beantwortet vor 2 Jahren
Relevanter Inhalt
- AWS OFFICIALAktualisiert vor einem Jahr
- AWS OFFICIALAktualisiert vor 2 Jahren
- AWS OFFICIALAktualisiert vor einem Jahr
- AWS OFFICIALAktualisiert vor 3 Jahren
I believe it could be possible, but have you tried out Amazon Textract? It is purpose built for text extraction, and provides bounding box coordinates for all text and images. You can then process the response anyway you like to extract text, paragraphs, forms etc. You can get started here - https://aws.amazon.com/textract/.