- Newest
- Most votes
- Most comments
I reproduced your case, and have submitted a feedback entry on the console.
At the moment I'd suggest you to translate the text without the HTML tag. Which gives you a better result for your use case, like Marrick mentioned above.
Another alternative is running a grammar check after receiving the translated text.
Hey! It seems like you've stumbled on a fairly interesting bug. I've managed to reproduce the issue using the CLI, which shows that this is a bug with Translate itself not just the Console:
Translate seems to handle other kinds of brackets fine, but angled brackets definitely trip it up a little bit. According the Translate API doucumenation, strings with angled brackets should be supported. However, angle brackets are used to specify do-not-translate tags and so this issue likely lies with the implementation of this feature.
Anyways, it sounds like you may be trying to translate HTML documents specifically? If so, you may find the example code for translating a web page useful: https://docs.aws.amazon.com/translate/latest/dg/examples-web.html. Instead of translating the raw HTML, it uses an HTML parser to only translate the text parts of the page while leaving the tags unchanged.
Hope that helps!
Marrick.
We have considered the "translate whole pages" approach in the beginning, and decided against it. Our requirement is that we translate CMS content exhaustively, not just some selected partial views of the content (i.e. "pages"). We crawl the CMS database and extract strings into a catalog, then we upload it to localise.biz, and before switching to machine translation we would just hand the localise.biz project to a human translator. Now we still want to have the catalog on localise.biz for human intervention, but we now export XLIFF from there and batch-translate. It used to work; the bug is new.
Relevant content
- asked a year ago
- asked 4 months ago
- AWS OFFICIALUpdated 2 years ago
- AWS OFFICIALUpdated 3 months ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated a year ago
We need to translate large collections of HTML fragments. Nothing short of Amazon actually fixing the bug will work for us.
Any progress?