Bayesian Uncertainty Driven Exploration

The project investigates the methodology of posterior sampling using Bayesian Networks for driving exploration in complex environments. Posterior sampling allows the agent to perform deep exploration of the environment by sampling different Q-value function for each episode. The requirement for such a strategy is to maintaina distribution over Q-value functions. The exploration in this strategy is driven by the variance of the posterior distribution that is sampled from each episode.

The Bayes-By-Backprop algorithm (Blundell etal., 2015) is employed to maintain a Bayesian Network that acts as the distribution over Q-value functions and is efficiently updated using Backpropagation algorithm.

Running

The 3 notebooks correspond to the 3 different environments i.e. Chain, CartPole and Pendulum. Simply running the notebooks should start training the network.

Tensorboard can be used to view the plots saved in results folder by

tensorboard --logdir results/Cartpole.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
graphs		graphs
imgs		imgs
results		results
BayesianNetwork.py		BayesianNetwork.py
BayesianQNetwork.py		BayesianQNetwork.py
Chain_env.py		Chain_env.py
Distributions.py		Distributions.py
README.md		README.md
agent_train_cartpole.ipynb		agent_train_cartpole.ipynb
agent_train_chain.ipynb		agent_train_chain.ipynb
agent_train_pendulum.ipynb		agent_train_pendulum.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bayesian Uncertainty Driven Exploration

Running

About

Releases

Packages

Languages

shishir13sharma/Bayesian-Uncertainty-Driven-Exploration

Folders and files

Latest commit

History

Repository files navigation

Bayesian Uncertainty Driven Exploration

Running

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages