RL-PPO-Tensorflow

Tensorflow implementation of Proximal Policy Optimization Algorithms
Basic Policy-Gradient model is hard to trainning very likely,so the enhanced version has appeared.
If you don't know basic Policy-Gradient algorithm or have no experience about training Basic Policy-Gradient model,I suggest you look at my project:"Basic_Policy_Gradient" first.
This is an implementation of basic Proximal Policy Optimization Algorithm to play the game:"CartPole-v0" and "Pendulum-v0".
You can Change code to play other OpenAi Gym games. You can also Optimize this algorithm.
If you want to exchange ideas with me，you can add me to WeChat:zggcdbs.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
PPT		PPT
log		log
paper		paper
README.md		README.md
new.py		new.py
simple-PPO_Pendulum.py		simple-PPO_Pendulum.py
simple-PPO_cartRole.py		simple-PPO_cartRole.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL-PPO-Tensorflow

About

Releases

Packages

Languages

zhibindaxia/RL-PPO-Tensorflow

Folders and files

Latest commit

History

Repository files navigation

RL-PPO-Tensorflow

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages