Speaker partitioning overlap times

1

In the documentation for speaker partitioning, it mentions:

Utterances that overlap in the input audio don't overlap in the transcription output.

Does this mean that the transcribed text itself does not overlap? Or will the times never overlap?

For example, if Speaker A has times 1 -> 5 and Speaker B has times 3 -> 7, how will these times appear in the transcription output?

In tests, I have not been able to see these times represented accurately with overlap. Is it possible to change settings to allow for this?

Austin
asked 9 months ago217 views
1 Answer
0

Hi,

The documentation says

If an utterance from one speaker overlaps with an utterance from another speaker, Amazon Transcribe 
Medical orders them in the transcription by their start times. Utterances that overlap in the input audio 
don't overlap in the transcription output.

So, the ones who starts speaking first by start time is transcribed first for his full utterance and the comes the ones who started second.

Best,

Didier

profile pictureAWS
EXPERT
answered 9 months ago
  • But in the example of "Speaker A has times 1 -> 5 and Speaker B has times 3 -> 7," would that not still match that description? The results we're seeing are instead Speaker A 1 -> 5 and Speaker B 5 -> 7 which is not accurate to the input audio. Just making sure we don't have an improper configuration.

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions