Skip to content

Implementation of First return, then explore (Go-Explore)

License

Notifications You must be signed in to change notification settings

Hauf3n/GoExplore-Atari-PyTorch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GoExplore-Atari-PyTorch

Implementation of First return, then explore (Go-Explore) by Adrien Ecoffet, Joost Huizinga, Joel Lehman, Kenneth O. Stanley, Jeff Clune. The result is a neural network policy that reaches a score of 2500 on the Atari environment MontezumaRevenge.

Content

  1. Exploration Phase with demonstration generation
  2. Robustification Phase (PPO + SIL + Backward algorithm)

Results

  1. Exploration Phase demos with 2500 score
    1 2 4 5

  2. Robustification Phase with 2500 score
    9

  3. Robustification Phase backward algorithm progress
    10

  4. Robustification Phase path
    11

About

Implementation of First return, then explore (Go-Explore)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages