Persistent data and time in reward_function

0

I want to identify events such as the car changing direction or crossing a threshold and calculate how frequently those events happen. This will require a history of values and some knowledge of time. Is there any way to get this data within the reward function? From the examples it seems like we only have the ability to know the cars instantaneous state.

averdow
已提问 5 年前235 查看次数
2 回答
0

This is not necessarily the answer you are looking for, but you can do this if you use the SageMaker Jupyter notebook method to train your model. In fact the code already does this, so you can just follow the patterns in there.

From what I can see this will not work in the DeepRacer console, but I have not tried very hard.

已回答 5 年前
0

I suspected that could be the case, will give it a shot. Thanks!

averdow
已回答 5 年前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则

相关内容