Tensorflow implementation of Proximal Policy Optimization Algorithms
Basic Policy-Gradient model is hard to trainning very likely,so the enhanced version has appeared.
If you don't know basic Policy-Gradient algorithm or have no experience about training Basic Policy-Gradient model,I suggest you look at my project:"Basic_Policy_Gradient" first.
This is an implementation of basic Proximal Policy Optimization Algorithm to play the game:"CartPole-v0" and "Pendulum-v0".
You can Change code to play other OpenAi Gym games. You can also Optimize this algorithm.
If you want to exchange ideas with me,you can add me to WeChat:zggcdbs.
forked from TangLaoDA/RL-PPO-Tensorflow
-
Notifications
You must be signed in to change notification settings - Fork 0
zhibindaxia/RL-PPO-Tensorflow
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Tensorflow implementation of Proximal Policy Optimization Algorithms
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published
Languages
- Python 100.0%