Fundamental of Reinforcement Learning
Preface
Cover
1. Introduction
2. Markov Decision Process
- MDP
- Value Function
3. Bellman Equation
- Bellman Expectation Equation
- Bellman Optimality Equation
4. Dynamic Programming
5. Monte-Carlo Learning
- Monte-Carlo Prediction
- Monte-Carlo Control
6. Temporal Difference Learning
7. Off-Policy Control
- Importance Sampling
- Q Learning
8. Value Function Approximation
9. DQN(Deep Q - Networks)
- Neural Network
- Deep Q Networks
10. Policy Gradient

Powered by GitBook

7. Off-Policy Control

Q learning

1. Importance Sampling

2. Q Learning

results matching ""

No results matching ""