How to improve accuracy of speaker diarization?

0

What steps can I take to improve AWS Transcribe's Speaker Diarization accuracy?

Unfortunately the algorithm is not doing a great job of correctly identifying who is talking, even with a clean audio recording. Its even having trouble distinguishing between a man and a woman's voice.

Much appreciated!

boogie
demandé il y a 2 ans534 vues
1 réponse
0

Other than a clean audio recording, I'd optimize the following factors:

  • When using custom vocabularies: keep the list small, and provide IPA pronunciations if you can.
  • When using real-time streams: two to five speakers seem best.
  • When using an audio source: set the "Maximum number of speaker" to the actual number of speakers in the file.

For sources and more information:
https://aws.amazon.com/transcribe/faqs/
https://docs.aws.amazon.com/transcribe/latest/dg/diarization.html

répondu il y a 2 ans

Vous n'êtes pas connecté. Se connecter pour publier une réponse.

Une bonne réponse répond clairement à la question, contient des commentaires constructifs et encourage le développement professionnel de la personne qui pose la question.

Instructions pour répondre aux questions