Similar Tracks
Bellman Equations, Dynamic Programming, Generalized Policy Iteration | Reinforcement Learning Part 2
Mutual Information
David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86
Lex Fridman
A friendly introduction to deep reinforcement learning, Q-networks and policy gradients
Serrano.Academy
Temporal Difference Learning (including Q-Learning) | Reinforcement Learning Part 4
Mutual Information