Reinforcement Learning – Coding (Döring, Ferdinand)
This is a coding course in which algorithms from the reinforcement learning lecture from the previous semester will be coded in python. Algorithms cover bandit algorithms (briefly), Q-learning, other TD algorithms, policy gradient algorithms. The course is jointly with PORSCHE, we will try to improve some of their reinforcement learning projects using PPO.