Transcription not recognizing multiple speakers

0

I am using AWS Transcribe to generate text transcriptions of audio files that usually have multiple speakers. I am using a python lambda with Boto3 Transcribe to start the transcription jobs.

Two of the files that I have attempted to do this on in the past week have not had the multiple speakers recognized by AWS Transcribe. Both files had two speakers, but the text transcription coming out of AWS Transcribe only shows one speaker. When starting the transcription job, the MaxSpeakerLabels setting was set to 2.

I can see how in one of the files, the voices might sound similar, but in the other file, the voices are very clearly different (female/male). Is there any other way to set the number of speakers on a transcription job to make sure that it is recognizing 2 speakers? Instead of just a max number?

asked 2 years ago580 views
1 Answer
0

Hi, In addition to MaxSpeakerLabels, have you also set the ShowSpeakerLabels parameter to true value? Please check the speaker diarization page and the complete set of StartTranscriptionJob parameters. To go faster with the troubleshooting you can also try to test your audio files from the Amazon Transcribe console. Hope this helps.

AWS
Ioan
answered 2 years ago
  • Hi, yes the ShowSpeakerLabels parameter was set to true on these jobs. I have checked that the appropriate parameters are all set and this has worked on other files.

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions