Postgraduate course

Deep Reinforcement Learning

Course information, announcements, and lecture materials for students at Nanjing University.

Information

Instructor: Zhi Wang
Time: Thursday, 14:00–17:00
Location: A310 Feiyimin Building, Nanjing University at Gulou Campus

Announcements

Mar 5, 2026Teaching Assistant: Wenhao Wu (whao_wu@163.com)

Lecture Slides

Mar 5, 2026Lecture 1: Introduction to Reinforcement Learning · Notes
Mar 12, 2026Lecture 2: Preliminaries of Machine Learning
Mar 19, 2026Lecture 3: Dynamic Programming
Mar 10, 2026Lecture 4: Monte-Carlo Methods and Temporal-Difference Learning
Mar 17, 2026Lecture 5: Introduction to Deep Reinforcement Learning
Mar 24, 2026Lecture 6: Policy Gradient
Mar 31, 2026Lecture 7: Advanced Policy Gradient
Apr 7, 2026Lecture 8: Actor-Critic Algorithms
Apr 14, 2026Lecture 9: Value Function Methods
Apr 21, 2026Lecture 10: Deep Q-Learning
Apr 28, 2026Lecture 11: Multi-Agent Reinforcement Learning
May 5, 2026Lecture 12: Transfer Reinforcement Learning
May 12, 2026Lecture 13: Large Language Models and RL
May 19, 2026Lecture 14: Presentation — Part I
May 26, 2026Lecture 15: Presentation — Part II
Jun 2, 2026Lecture 16: Presentation — Part III

References

Richard S. Sutton and Andrew G. Barto, Reinforcement Learning: An Introduction, 2nd Edition
Sergey Levine, CS285: Deep Reinforcement Learning at UC Berkeley

For course questions, please contact the instructor or teaching assistant listed above.