Skip to content

Tensorflow implementation of Proximal Policy Optimization Algorithms

Notifications You must be signed in to change notification settings

zhibindaxia/RL-PPO-Tensorflow

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RL-PPO-Tensorflow

Tensorflow implementation of Proximal Policy Optimization Algorithms
Basic Policy-Gradient model is hard to trainning very likely,so the enhanced version has appeared.
If you don't know basic Policy-Gradient algorithm or have no experience about training Basic Policy-Gradient model,I suggest you look at my project:"Basic_Policy_Gradient" first.
This is an implementation of basic Proximal Policy Optimization Algorithm to play the game:"CartPole-v0" and "Pendulum-v0".
You can Change code to play other OpenAi Gym games. You can also Optimize this algorithm.
If you want to exchange ideas with me,you can add me to WeChat:zggcdbs.

About

Tensorflow implementation of Proximal Policy Optimization Algorithms

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%