Lecture 8
This week we'll do some general reading on reinforcement learning. These readings are intended as an introduction to some of the main themes in RL. Feel free to suggest others or substitutes.
Policy Gradients:
Peters & Schaal, Reinforcement learning of motor skills with policy gradients Links to an external site.: Neural Networks, 2008
Slides Download Slides by Biye Jiang
Deep Q-Learning:
Volodymyr Mnih Koray Kavukcuoglu David Silver Alex Graves Ioannis Antonoglou, Daan Wierstra Martin Riedmiller, Playing Atari with Deep Reinforcement Learning Links to an external site. arXiv, 2013
Slides Download Slides by Vincenc Rubies Royo
Search (DAGGER):
Stéphane Ross Geoffrey J. Gordon J. Andrew Bagnell, A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Links to an external site., arXiv and AISTATS 2011.
Slides
Download Slides by Haoyu Chen