Persistent data and time in reward_function

0

I want to identify events such as the car changing direction or crossing a threshold and calculate how frequently those events happen. This will require a history of values and some knowledge of time. Is there any way to get this data within the reward function? From the examples it seems like we only have the ability to know the cars instantaneous state.

averdow
已提問 5 年前檢視次數 235 次
2 個答案
0

This is not necessarily the answer you are looking for, but you can do this if you use the SageMaker Jupyter notebook method to train your model. In fact the code already does this, so you can just follow the patterns in there.

From what I can see this will not work in the DeepRacer console, but I have not tried very hard.

已回答 5 年前
0

I suspected that could be the case, will give it a shot. Thanks!

averdow
已回答 5 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南