Mathematics in Reinforcement learning
Theoretical Course, Bilibili, 2025
This theoretical series is designed to walk you through the mathematical foundations that are central to modern RL algorithms. Throughout the course, we will cover key topics including unbiased versus biased estimations, the structure of Markov Decision Processes (MDPs), the mathematical definition and role of policies, value functions, and essential methods such as Monte Carlo sampling. Just come over and have fun
