Are non-categorical fields used for training in Amazon Personalize with User Personalization recipe?

0

For example, if I have the following User schema:

{
	"name": "USER_ID",
	"type": "string"
},
{
	"name": "AGE",
	"type": [
	  "null",
	  "int"
	],
        "categorical": true
},
{
	"name": "GENDER",
	"type": [
          "null",
	  "string"
	],
},
{
        "name": "USER_DESCRIPTION",
        "type": "string"
}

Will the "GENDER" field be used for training or will only the "AGE" be used for training?

Also, what are the differences between contextual & categorial metadata?

Is this applicable for Users, Item, and User-Item interactions?

I am new to Amazon Personalize, so I apologize if this is trivial.

1 Risposta
0

Hi & thanks for asking!

As mentioned in the "Metadata fields" section of the Custom Datasets and Schemas developer guide page:

If you add your own metadata field of type string, it must include the categorical attribute or the textual attribute (only Items schemas support fields with the textual attribute). Otherwise, Amazon Personalize won't use the field when training a model.

...So you must explicitly specify string fields as either categorical or textual, otherwise Personalize will ignore them in model training (which can also be useful - if you'd like to include a field for filtering purpose but not have it affect the model). Since you're building a user metadata schema, not item - it'll need to be categorical.

However - I think you'll also need to remove the categorical flag from your "AGE" field there: since it's a numeric rather than string. You can also refer to this example user metadata schema.

AWS
ESPERTO
Alex_T
con risposta 2 anni fa
  • To expand on what Alex said, numerical fields are always used model training (for the recipes and recommenders that use that data). You can have the AGE field as categorical field if you want to define your own categories, for instance "child", "teen", "adult". You should do this if you have particular categories that make sense for your use-case rather than relying on Amazon Personalize to process the data. Non categorical/non numerical fields can be added to be used for filtering and numerical fields can be used for optimizing on an additional objective.

Accesso non effettuato. Accedi per postare una risposta.

Una buona risposta soddisfa chiaramente la domanda, fornisce un feedback costruttivo e incoraggia la crescita professionale del richiedente.

Linee guida per rispondere alle domande