Skip to content

Misc code associated with exercises in Sutton's 'Reinforcement Learning'

Notifications You must be signed in to change notification settings

SteveDraper/rl_book

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

rl_book

Misc code associated with exercises in Sutton's 'Reinforcement Learning'. Suitable environment provided for conda in envirnment.yml

Chapter 1 & Chapter 2

Hadn't started this repo at this point

TODO

Go back and implement at least a bandit framework

Chapter 3

Implemented MDP framework in a minimal-dependencies manner, with reusable abstractions for probability distributions. The only dependencies are numpy and matplotlib (the latter just to generate chart results for the exercises)

TODO

Should try porting to a probabilistic programming framework such as Pyro

Chapter 4

Added code for (4.5) and (4.9) using the MDP framework. Interesting results in regard to stability of policy where multiple optimal (or near optimal?) policies exist

About

Misc code associated with exercises in Sutton's 'Reinforcement Learning'

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages