Amazon Transcribe table format specifying IPA
I am processing a custom vocabulary table using the console at: https://console.aws.amazon.com/transcribe/
I've chosen a line from the example given
Phrase[TAB]SoundsLike[TAB]IPA[TAB]DisplayAs C.L.I.[TAB][TAB]sɪ ɛl aɪ[TAB]CLI`
TAB is replaced with actual tabs and line endings ending with LF
Once processed for an en-GB or en-US custom vocabulary, it fails with the error:
Validation error: File contains invalid characters or format in the IPA column. Error at line 2.
If I modify the file to include extra spaces it succeeds, as in
Phrase[TAB]SoundsLike[TAB]IPA[TAB]DisplayAs C.L.I.[TAB][TAB]s ɪ ɛ l aɪ[TAB]CLI
This error indicates that the examples given are incorrect or outdated. Is there another way to specify the syllables of a word by grouping characters together as in the examples?
This was an issue with the webpage examples. The examples have now been updated and now show spaces between all characters. So the issue I was initially seeing was intended behaviour, and spaces are required.
Custom language model not showing for real-time transcriptionAccepted Answerasked 5 months ago
Charges for CREATE TABLE AS SELECTAccepted Answerasked 4 months ago
How do I use a custom vocabulary when using Amazon Transcribe streaming?asked a year ago
Amazon Transcribe table format specifying IPAAccepted Answerasked 2 months ago
How to get the full list of words that Amazon Transcribe can recognize for a specific language?asked 2 months ago
Quicksight - Total line in Table to show Sum of Max values from each rowAccepted Answerasked 4 months ago
Amazon Transcribe Creating a custom vocabulary fail at line 2asked 4 months ago
inserting data from MQTT into dynamodbasked 13 days ago
Creating new custom vocabularies from a table is brokenasked 2 years ago
Can we add column to an existing table in AWS Athena using SQL query?Accepted Answerasked 3 years ago