1 réponse
- Le plus récent
- Le plus de votes
- La plupart des commentaires
1
Hello,
To extract the key components like title, author, and description from product descriptions in your PDF catalogs, Textract currently does not have built-in capabilities for that level of customization.
Machine learning models trained on sample catalog pages could help automatically classify the text into different fields. Services like Amazon SageMaker, AWS Glue, etc can help build such models.
répondu il y a un mois
Contenus pertinents
- demandé il y a un an
- Réponse acceptéedemandé il y a 7 mois
- demandé il y a 5 mois
- AWS OFFICIELA mis à jour il y a un an
- AWS OFFICIELA mis à jour il y a 2 ans
- AWS OFFICIELA mis à jour il y a 4 mois
You can develop a post-processing system that applies rules to classify text blocks based on layout patterns, or for a more sophisticated solution, train a custom machine learning model with Amazon SageMaker to recognize and categorize the text appropriately.