+ [斯坦福 cs234 强化学习中文讲义](README.md) + [Lecture 1 Introduction to Reinforcement Learning](1.md) + [Lecture 3 Model Free Policy Evaluation: Policy Evaluation Without Knowing How the World Works](3.md) + [Lecture 4 Model Free Control](4.md) + [Lecture 5 Value Function Approximation](5.md) + [Lecture 6 CNNs and Deep Q-learning](6.md) + [Lecture 7 Imitation Learning](7.md) + [Lecture 8&9 Policy Gradient](8&9.md) + [Lecture 10 Advanced Policy Gradient](10.md) + [Lecture 11&12 Exploration and Exploitation](11&12.md) + [Lecture 14 Model Based RL, Monte-Carlo Tree Search](14.md)