Speaker partitioning overlap times

1

In the documentation for speaker partitioning, it mentions:

Utterances that overlap in the input audio don't overlap in the transcription output.

Does this mean that the transcribed text itself does not overlap? Or will the times never overlap?

For example, if Speaker A has times 1 -> 5 and Speaker B has times 3 -> 7, how will these times appear in the transcription output?

In tests, I have not been able to see these times represented accurately with overlap. Is it possible to change settings to allow for this?

Austin
已提问 10 个月前224 查看次数
1 回答
0

Hi,

The documentation says

If an utterance from one speaker overlaps with an utterance from another speaker, Amazon Transcribe 
Medical orders them in the transcription by their start times. Utterances that overlap in the input audio 
don't overlap in the transcription output.

So, the ones who starts speaking first by start time is transcribed first for his full utterance and the comes the ones who started second.

Best,

Didier

profile pictureAWS
专家
已回答 10 个月前
  • But in the example of "Speaker A has times 1 -> 5 and Speaker B has times 3 -> 7," would that not still match that description? The results we're seeing are instead Speaker A 1 -> 5 and Speaker B 5 -> 7 which is not accurate to the input audio. Just making sure we don't have an improper configuration.

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则