RL note(1)_Why exploratory is needed

来源:互联网 发布:淘宝海报尺寸大小 编辑:程序博客网 时间:2024/05/07 05:36

[sutton's book section 2.2]

Why exploratory is needed:

1. greedy selection is bad in the long run, as at each time, the agent has to wait for(get stucking) the suboptimal action.

2. when true values of actions changed over time, exploration is needed to make sure one of the non-greedy actions has not changed to become better than the greedy one

3. the more rewards variations, the better performance exploratory achieves


原创粉丝点击