In the recent years, we have witnessed spectacular progress in applying techniques of reinforcement learning to problems that have for a long time considered to be out-of-reach -- be it the game of „Go“ or autonomous driving. This course is about Markov decision processes, which is the mathematical foundation of reinforcement learning. The style of the course will be two-fold. On the one hand, the lecture will provide rigorous definitions and proves for the most central motives in Markov decision processes. On the other hand, this theory will be illustrated by hands-on implementations reflecting the most recent developments in this fast-moving field.
Prof. Dr. Christian Hirsch
Lecture: Monday, 10.15-11.45, B6, A203, Tuesday, 10.15-11.45, A5, C012
There are additional lectures on
There are no lectures on
Dimitri P. Bertsekas (2005, 3rd edition) Dynamic Programming and Optimal Control, Vol. I
Tor Lattimore and Csaba Szepesvári (2019) Bandit Algorithms, https://tor-lattimore.com/downloads/book/book.pdf
Martin L. Puterman (2009, 2nd edition) Markov Decision Processes: Discrete Stochastic Dynamic Programming