CONTROL
RL for Prediction
RL for Control
Certainty Equivalence
TD for Control
Action Values
Learning Policies
Update Rule: Q-learning
Demo...
Next:
RL for Prediction
Up:
Reinforcement Learning (16)
Previous:
Temporal Difference Methods