Text not recognized as "HANDWRITING" in Textract

0

Hi everyone,

I have an issue with a code that was working fine in production using Textract and which does not work anymore. When using Textract, it does not detect anymore that a line or a word is handwritten and not printed. In the output of textract.analyze_document(Document={'Bytes': file_content},FeatureTypes=["FORMS"]) (Python), when there is a "TextType" field, there is always a "PRINTED" value even though the word is obviously handwritten and that for any document i used as an input.

Does anyone have the same issue ? Is it a change with boto3 or Textract itself ?

Thanks in advance, Quentin

  • Hi everyone,

    I have the same issue. Using textractClient.startDocumentAnalysis(startDocumentAnalysisRequest) all words are set to PRINTED.

    But using StartExpenseAnalysisRequest the textType is set correctly. { "BlockType": "WORD", "Confidence": 35.711299896240234, "Text": "Untaborist", "TextType": "HANDWRITING", ... }

    Best regards

1 Risposta
0

Thank you for using Textract. We would like to help with the issue you are facing. Can you create a case through AWS Support Center in your AWS account so that you can share some sample documents in which you are seeing issue if nature documents is not sensitive.

AWS
con risposta un anno fa
  • As my company only have the Basic Support plan, we cannot contact the AWS Technical Support about this issue.

    Best regards, Quentin

  • Can you share image or document here?

Accesso non effettuato. Accedi per postare una risposta.

Una buona risposta soddisfa chiaramente la domanda, fornisce un feedback costruttivo e incoraggia la crescita professionale del richiedente.

Linee guida per rispondere alle domande