Transcription not recognizing multiple speakers

0

I am using AWS Transcribe to generate text transcriptions of audio files that usually have multiple speakers. I am using a python lambda with Boto3 Transcribe to start the transcription jobs.

Two of the files that I have attempted to do this on in the past week have not had the multiple speakers recognized by AWS Transcribe. Both files had two speakers, but the text transcription coming out of AWS Transcribe only shows one speaker. When starting the transcription job, the MaxSpeakerLabels setting was set to 2.

I can see how in one of the files, the voices might sound similar, but in the other file, the voices are very clearly different (female/male). Is there any other way to set the number of speakers on a transcription job to make sure that it is recognizing 2 speakers? Instead of just a max number?

已提问 2 年前639 查看次数
1 回答
0

Hi, In addition to MaxSpeakerLabels, have you also set the ShowSpeakerLabels parameter to true value? Please check the speaker diarization page and the complete set of StartTranscriptionJob parameters. To go faster with the troubleshooting you can also try to test your audio files from the Amazon Transcribe console. Hope this helps.

AWS
Ioan
已回答 2 年前
  • Hi, yes the ShowSpeakerLabels parameter was set to true on these jobs. I have checked that the appropriate parameters are all set and this has worked on other files.

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则