Text not recognized as "HANDWRITING" in Textract

0

Hi everyone,

I have an issue with a code that was working fine in production using Textract and which does not work anymore. When using Textract, it does not detect anymore that a line or a word is handwritten and not printed. In the output of textract.analyze_document(Document={'Bytes': file_content},FeatureTypes=["FORMS"]) (Python), when there is a "TextType" field, there is always a "PRINTED" value even though the word is obviously handwritten and that for any document i used as an input.

Does anyone have the same issue ? Is it a change with boto3 or Textract itself ?

Thanks in advance, Quentin

  • Hi everyone,

    I have the same issue. Using textractClient.startDocumentAnalysis(startDocumentAnalysisRequest) all words are set to PRINTED.

    But using StartExpenseAnalysisRequest the textType is set correctly. { "BlockType": "WORD", "Confidence": 35.711299896240234, "Text": "Untaborist", "TextType": "HANDWRITING", ... }

    Best regards

asked a year ago224 views
1 Answer
0

Thank you for using Textract. We would like to help with the issue you are facing. Can you create a case through AWS Support Center in your AWS account so that you can share some sample documents in which you are seeing issue if nature documents is not sensitive.

AWS
answered a year ago
  • As my company only have the Basic Support plan, we cannot contact the AWS Technical Support about this issue.

    Best regards, Quentin

  • Can you share image or document here?

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions