Statistical Planning and Reinforcement Learning
Planning under uncertainty, Markov decision processes, reinforcement learning algorithms, policy optimization, and multi-agent systems.
MDPQ-LearningPolicy GradientReinforcement LearningPlanning
Overview
How agents learn to make sequential decisions. Covers MDPs, dynamic programming, Monte Carlo methods, temporal-difference learning (Q-learning, SARSA), policy gradient methods, and deep reinforcement learning.
Content Coming Soon
Tutorials, notebooks, and learning materials will be added as I progress through this topic.