rl_book

Misc code associated with exercises in Sutton's 'Reinforcement Learning'. Suitable environment provided for conda in envirnment.yml

Chapter 1 & Chapter 2

Hadn't started this repo at this point

TODO

Go back and implement at least a bandit framework

Chapter 3

Implemented MDP framework in a minimal-dependencies manner, with reusable abstractions for probability distributions. The only dependencies are numpy and matplotlib (the latter just to generate chart results for the exercises)

TODO

Should try porting to a probabilistic programming framework such as Pyro

Chapter 4

Added code for (4.5) and (4.9) using the MDP framework. Interesting results in regard to stability of policy where multiple optimal (or near optimal?) policies exist

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
python		python
.gitignore		.gitignore
Exercises.md		Exercises.md
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rl_book

Chapter 1 & Chapter 2

TODO

Chapter 3

TODO

Chapter 4

About

Releases

Packages

Contributors 2

Languages

SteveDraper/rl_book

Folders and files

Latest commit

History

Repository files navigation

rl_book

Chapter 1 & Chapter 2

TODO

Chapter 3

TODO

Chapter 4

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages