Understanding Language Support in AWS Personalize

0

Hello, we have recently decided to implement AWS Personalize for our e-commerce website. While going through the documentation, I noticed that only seven languages are listed as supported. I have a few questions regarding this:

When it mentions "support for language," does it imply that AWS Personalize has the capability to understand the content of the text? For example, would it relate products with different but synonymous descriptions together?

Or does it simply mean that only the text written in those supported languages is accepted? For instance, if I were to "Latinize" my data, would AWS Personalize still be able to establish relationships between products?

MJ2000
已提問 10 個月前檢視次數 307 次
1 個回答
0
已接受的答案

Amazon Personalize supports for languages enables customers to unlock the information trapped in their product descriptions, reviews, movie synopses or other unstructured text to generate highly relevant recommendations for users. So as you mentioned, it is about contextualizing the unstructured text in the data set. You can see this in this reference documentation.

AWS
已回答 10 個月前
profile picture
專家
已審閱 10 個月前
profile picture
專家
已審閱 10 個月前
  • Thank you for the clarification. To confirm, does this mean that AWS Personalize is capable of extracting meaning from the provided unstructured text data? Additionally, since we do not currently have plans to translate our data into English, would it be advisable to simply Latinize the data and publish it in that form?

  • Yes, it uses NLP behind the scene to extract key elements from the metadata. can you elaborate what you mean by Latinizing ?

  • Apologies for any confusion caused by my previous posts. This is what I mean by Latinizing:

    In my language, the word for "sky" is represented by the characters "ცა." By Latinizing, I am referring to the process of replacing each of these characters with their Latin counterparts. In this particular example, the Latinized version of "ცა" would be "Tsa."

  • If your data includes any non-ASCII encoded characters, your CSV file must be encoded in UTF-8 format : https://docs.aws.amazon.com/personalize/latest/dg/data-prep-formatting.html

    So I don't think you need to Latinize your data.

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南