What's the realtime factor for the Transcribe Streaming SDK?

0

Hi there!

We are using AWS Transcribe for streaming speech-to-text transcription. We're keen to understand whether the transcription is faster than realtime, and if so, what's the expected maximum factor?

As an example, if we buffer 5 seconds of audio data on the client before pushing into the stream for transcription, then send the buffer all at once, how long would we expect to wait before it catches up to real-time again? It seems, from experiments, that it's not faster than realtime (exactly 1x?), and transcription results stay at a 5 second delay as we push more audio data into it, but I wanted to verify that's the case!

Thank you very much, Ben

質問済み 2年前120ビュー
回答なし

ログインしていません。 ログイン 回答を投稿する。

優れた回答とは、質問に明確に答え、建設的なフィードバックを提供し、質問者の専門分野におけるスキルの向上を促すものです。

質問に答えるためのガイドライン

関連するコンテンツ