Persistent data and time in reward_function

0

I want to identify events such as the car changing direction or crossing a threshold and calculate how frequently those events happen. This will require a history of values and some knowledge of time. Is there any way to get this data within the reward function? From the examples it seems like we only have the ability to know the cars instantaneous state.

averdow
demandé il y a 5 ans235 vues
2 réponses
0

This is not necessarily the answer you are looking for, but you can do this if you use the SageMaker Jupyter notebook method to train your model. In fact the code already does this, so you can just follow the patterns in there.

From what I can see this will not work in the DeepRacer console, but I have not tried very hard.

répondu il y a 5 ans
0

I suspected that could be the case, will give it a shot. Thanks!

averdow
répondu il y a 5 ans

Vous n'êtes pas connecté. Se connecter pour publier une réponse.

Une bonne réponse répond clairement à la question, contient des commentaires constructifs et encourage le développement professionnel de la personne qui pose la question.

Instructions pour répondre aux questions