Doyoung’s Robot Diary

Doyoung Kim

I am a person of Reinforcement Learning

[Dynamic Programming] Policy Iteration

6 minute read

We will sequentially cover Dynamic Programming, Monte Carlo methods, and Temporal Difference methods. Sutton describes these methods in his book as follows:

Markov Decision Process

8 minute read

Markov Process (MP, Markov Chain)

State Value Function Law of Iterative Expectation

2 minute read

State Value Function and the Law of Iterative Expectation in Reinforcement Learning

Law of Iterative Expectation proof

1 minute read

Law of Iterative Expectation (LIE) - Proof

Generative Adversarial Networks(GAN) code review

4 minute read