Speaker partitioning overlap times

1

In the documentation for speaker partitioning, it mentions:

Utterances that overlap in the input audio don't overlap in the transcription output.

Does this mean that the transcribed text itself does not overlap? Or will the times never overlap?

For example, if Speaker A has times 1 -> 5 and Speaker B has times 3 -> 7, how will these times appear in the transcription output?

In tests, I have not been able to see these times represented accurately with overlap. Is it possible to change settings to allow for this?

Austin
已提問 10 個月前檢視次數 224 次
1 個回答
0

Hi,

The documentation says

If an utterance from one speaker overlaps with an utterance from another speaker, Amazon Transcribe 
Medical orders them in the transcription by their start times. Utterances that overlap in the input audio 
don't overlap in the transcription output.

So, the ones who starts speaking first by start time is transcribed first for his full utterance and the comes the ones who started second.

Best,

Didier

profile pictureAWS
專家
已回答 10 個月前
  • But in the example of "Speaker A has times 1 -> 5 and Speaker B has times 3 -> 7," would that not still match that description? The results we're seeing are instead Speaker A 1 -> 5 and Speaker B 5 -> 7 which is not accurate to the input audio. Just making sure we don't have an improper configuration.

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南