评论：Reinforcement Learning 重要性采样和Prioritized Experience Replay

内容简介：今天详细阅读了Prioritized Experience Replay这篇论文，记录下心得体会。 Introduction online RL目前面临的2个问题： strongly correlated updates that break the i.i.d. assumption of many popular stochastic gradient-based algorithms....

用户评论