Bellman Equations, Dynamic Programming, Generalized Policy Iteration | Reinforcement Learning Part 2

Similar Tracks
Lecture 17 - MDPs & Value/Policy Iteration | Stanford CS229: Machine Learning Andrew Ng (Autumn2018)
Stanford Online
Model Based Reinforcement Learning: Policy Iteration, Value Iteration, and Dynamic Programming
Steve Brunton