itarticle gong qingfeng decision theory / VPI – R&N 16.5 & 16.6 mdps and rl – R&N 17.1-17.4 reinforcement learning – R&N 21.1-21.6 2021-01-21 0
itarticle document worth reading: “a survey of inverse reinforcement learning: challenges, methods and progress” 2021-01-14 0
itarticle reinforcement learning chapter 3 An Example — Recycling Robot Goals and Rewards Returns and Episodes Policies and Value Functions Optimal Policies and Optimal Value Functions 0
itarticle tree-rl for object localization Reinforcement Learnin Recall Tree Structured Search Experiments 0