How to improve accuracy of speaker diarization?

0

What steps can I take to improve AWS Transcribe's Speaker Diarization accuracy?

Unfortunately the algorithm is not doing a great job of correctly identifying who is talking, even with a clean audio recording. Its even having trouble distinguishing between a man and a woman's voice.

Much appreciated!

boogie
已提问 2 年前534 查看次数
1 回答
0

Other than a clean audio recording, I'd optimize the following factors:

  • When using custom vocabularies: keep the list small, and provide IPA pronunciations if you can.
  • When using real-time streams: two to five speakers seem best.
  • When using an audio source: set the "Maximum number of speaker" to the actual number of speakers in the file.

For sources and more information:
https://aws.amazon.com/transcribe/faqs/
https://docs.aws.amazon.com/transcribe/latest/dg/diarization.html

已回答 2 年前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则