1
2
3
4
Liu, Yuxi (Hayden)
2019
Table of Contents:
“... Decision Processes and Dynamic Programming -- 3. Monte Carlo Methods for Making Numerical Estimations -- 4. Temporal Difference and Q-Learning -- 5. Solving Multi-armed Bandit Problems -- 6. Scaling Up Learning with Function Approximation -- 7. Deep Q-Networks...”2019
Full text
5
6
7
8
9
10
11
12
“...MWG - Simulation, Monte Carlo method, random numbers...”