Transcribe: Understand/Save Speaker Detection

0

TL;DR I need to recognize the speaker identification (diarization) of the user's voice; couldn’t find a way to do it. I am building an application using AWS Transcribe streaming. I am able to get the speaker labels of each word, this is great; but, I need to somehow save and reuse the speaker label for further use with the same user.

eggAI
已提問 2 年前檢視次數 340 次
1 個回答
0

Hi,

In addition to MaxSpeakerLabels, can you also set the ShowSpeakerLabels parameter to true value? Please check the speaker diarization page and the complete set of StartTranscriptionJob parameters. To go faster with the troubleshooting you can also try to test your audio files from the Amazon Transcribe console. Hope this helps.

Speaker diazization- https://docs.aws.amazon.com/transcribe/latest/dg/diarization.html

StartDescriptionJob- https://docs.aws.amazon.com/transcribe/latest/APIReference/API_StartTranscriptionJob.html

profile pictureAWS
支援工程師
已回答 2 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南