1 個回答
- 最新
- 最多得票
- 最多評論
0
Other than a clean audio recording, I'd optimize the following factors:
- When using custom vocabularies: keep the list small, and provide IPA pronunciations if you can.
- When using real-time streams: two to five speakers seem best.
- When using an audio source: set the "Maximum number of speaker" to the actual number of speakers in the file.
For sources and more information:
https://aws.amazon.com/transcribe/faqs/
https://docs.aws.amazon.com/transcribe/latest/dg/diarization.html
已回答 2 年前
相關內容
- 已提問 1 年前
- AWS 官方已更新 2 年前