Transcribe: Understand/Save Speaker Detection

0

TL;DR I need to recognize the speaker identification (diarization) of the user's voice; couldn’t find a way to do it. I am building an application using AWS Transcribe streaming. I am able to get the speaker labels of each word, this is great; but, I need to somehow save and reuse the speaker label for further use with the same user.

eggAI
질문됨 2년 전317회 조회
1개 답변
0

Hi,

In addition to MaxSpeakerLabels, can you also set the ShowSpeakerLabels parameter to true value? Please check the speaker diarization page and the complete set of StartTranscriptionJob parameters. To go faster with the troubleshooting you can also try to test your audio files from the Amazon Transcribe console. Hope this helps.

Speaker diazization- https://docs.aws.amazon.com/transcribe/latest/dg/diarization.html

StartDescriptionJob- https://docs.aws.amazon.com/transcribe/latest/APIReference/API_StartTranscriptionJob.html

profile pictureAWS
지원 엔지니어
답변함 2년 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠