- Newest
- Most votes
- Most comments
I have never used Polly, but does any of this information help? https://docs.aws.amazon.com/polly/latest/dg/speechmarks.html
To prevent the issue where multi-word entities like "King Philip II" are categorized both as separate words and as a whole phrase in speech marks, you can use a few strategies. One approach is to filter out duplicates in the generated speech marks by identifying overlapping entries and keeping only the complete phrase.
Alternatively, applying Named Entity Recognition (NER) before generating the speech marks can help recognize multi-word entities and treat them as a single unit. Another option is to check the configuration of your speech mark generator to adjust how multi-word entities are handled directly.
Thanks! I could implement your suggestion of filtering out duplicates in the generated speech marks that overlap. I was curious if you could elaborate on this though - "check the configuration of your speech mark generator to adjust how multi-word entities are handled directly" - from what I see, AWS Polly does not have any specific configuration options for word tokens, is there something I'm missing?
Relevant content
- AWS OFFICIALUpdated 4 years ago

Thanks! My issue is specific to the phrase I'm using, so I have gone over the documentation with examples and special characters as well, but have not found an answer yet.