内容简介:今天详细阅读了Prioritized Experience Replay这篇论文,记录下心得体会。 Introduction online RL目前面临的2个问题: strongly correlated updates that break the i.i.d. assumption of many popular stochastic gradient-based algorithms....
用户评论
推荐服务