Transcription not recognizing multiple speakers

0

I am using AWS Transcribe to generate text transcriptions of audio files that usually have multiple speakers. I am using a python lambda with Boto3 Transcribe to start the transcription jobs.

Two of the files that I have attempted to do this on in the past week have not had the multiple speakers recognized by AWS Transcribe. Both files had two speakers, but the text transcription coming out of AWS Transcribe only shows one speaker. When starting the transcription job, the MaxSpeakerLabels setting was set to 2.

I can see how in one of the files, the voices might sound similar, but in the other file, the voices are very clearly different (female/male). Is there any other way to set the number of speakers on a transcription job to make sure that it is recognizing 2 speakers? Instead of just a max number?

已提問 2 年前檢視次數 640 次
1 個回答
0

Hi, In addition to MaxSpeakerLabels, have you also set the ShowSpeakerLabels parameter to true value? Please check the speaker diarization page and the complete set of StartTranscriptionJob parameters. To go faster with the troubleshooting you can also try to test your audio files from the Amazon Transcribe console. Hope this helps.

AWS
Ioan
已回答 2 年前
  • Hi, yes the ShowSpeakerLabels parameter was set to true on these jobs. I have checked that the appropriate parameters are all set and this has worked on other files.

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南