Statistical Planning and Reinforcement Learning

Planning under uncertainty, Markov decision processes, reinforcement learning algorithms, policy optimization, and multi-agent systems.

MDPQ-LearningPolicy GradientReinforcement LearningPlanning

Overview

How agents learn to make sequential decisions. Covers MDPs, dynamic programming, Monte Carlo methods, temporal-difference learning (Q-learning, SARSA), policy gradient methods, and deep reinforcement learning.

Content Coming Soon

Tutorials, notebooks, and learning materials will be added as I progress through this topic.