Text not recognized as "HANDWRITING" in Textract

0

Hi everyone,

I have an issue with a code that was working fine in production using Textract and which does not work anymore. When using Textract, it does not detect anymore that a line or a word is handwritten and not printed. In the output of textract.analyze_document(Document={'Bytes': file_content},FeatureTypes=["FORMS"]) (Python), when there is a "TextType" field, there is always a "PRINTED" value even though the word is obviously handwritten and that for any document i used as an input.

Does anyone have the same issue ? Is it a change with boto3 or Textract itself ?

Thanks in advance, Quentin

  • Hi everyone,

    I have the same issue. Using textractClient.startDocumentAnalysis(startDocumentAnalysisRequest) all words are set to PRINTED.

    But using StartExpenseAnalysisRequest the textType is set correctly. { "BlockType": "WORD", "Confidence": 35.711299896240234, "Text": "Untaborist", "TextType": "HANDWRITING", ... }

    Best regards

feita há um ano237 visualizações
1 Resposta
0

Thank you for using Textract. We would like to help with the issue you are facing. Can you create a case through AWS Support Center in your AWS account so that you can share some sample documents in which you are seeing issue if nature documents is not sensitive.

AWS
respondido há um ano
  • As my company only have the Basic Support plan, we cannot contact the AWS Technical Support about this issue.

    Best regards, Quentin

  • Can you share image or document here?

Você não está conectado. Fazer login para postar uma resposta.

Uma boa resposta responde claramente à pergunta, dá feedback construtivo e incentiva o crescimento profissional de quem perguntou.

Diretrizes para responder a perguntas