Instructor: Zhi Wang
Time: 14:00-17:00pm, Thursday
Location: A310 Feiyimin Building, Nanjing University at Gulou Campus
Mar. 5th, 2026   |   Teaching Assitant: Wenhao Wu (Email: whao_wu@163.com)
2026.03.05   |   Lecture 1: Introduction to Reinforcement Learning | notes
2026.03.12   |   Lecture 2: Preliminaries of Machine Learning
2026.03.19   |   Lecture 3: Dynamic Programming
2024.03.10   |   Lecture 4: Monte-Carlo Methods and Temporal-Difference Learning
2024.03.17   |   Lecture 5: Introduction to Deep Reinforcement Learning
2024.03.24   |   Lecture 6: Policy Gradient
2024.03.31   |   Lecture 7: Advanced Policy Gradient
2024.04.07   |   Lecture 8: Actor-Critic Algorithms
2024.04.14   |   Lecture 9: Value Function Methods
2024.04.21   |   Lecture 10: Deep Q-learning
2024.04.28   |   Lecture 11: Multi-Agent Reinforcement Learning
2024.05.05   |   Lecture 12: Transfer Reinforcement Learning
2024.05.12   |   Lecture 13: Large Language Models and RL
2024.05.19   |   Lecture 14: Presentation - Part I
2023.05.26   |   Lecture 15: Presentation - Part II
2023.06.02   |   Lecture 16: Presentation - Part III
  
Note: References for lecture slides
Richard S. Sutton and Andrew G. Barto, Reinforcement Learning: An Introduction, 2nd Edition