Transcription not recognizing multiple speakers

0

I am using AWS Transcribe to generate text transcriptions of audio files that usually have multiple speakers. I am using a python lambda with Boto3 Transcribe to start the transcription jobs.

Two of the files that I have attempted to do this on in the past week have not had the multiple speakers recognized by AWS Transcribe. Both files had two speakers, but the text transcription coming out of AWS Transcribe only shows one speaker. When starting the transcription job, the MaxSpeakerLabels setting was set to 2.

I can see how in one of the files, the voices might sound similar, but in the other file, the voices are very clearly different (female/male). Is there any other way to set the number of speakers on a transcription job to make sure that it is recognizing 2 speakers? Instead of just a max number?

posta 2 anni fa640 visualizzazioni
1 Risposta
0

Hi, In addition to MaxSpeakerLabels, have you also set the ShowSpeakerLabels parameter to true value? Please check the speaker diarization page and the complete set of StartTranscriptionJob parameters. To go faster with the troubleshooting you can also try to test your audio files from the Amazon Transcribe console. Hope this helps.

AWS
Ioan
con risposta 2 anni fa
  • Hi, yes the ShowSpeakerLabels parameter was set to true on these jobs. I have checked that the appropriate parameters are all set and this has worked on other files.

Accesso non effettuato. Accedi per postare una risposta.

Una buona risposta soddisfa chiaramente la domanda, fornisce un feedback costruttivo e incoraggia la crescita professionale del richiedente.

Linee guida per rispondere alle domande