Skip to content

jupyter31/PolicyGrad

Folders and files

NameName
Last commit message
Last commit date

Latest commit

d415736 · Jul 11, 2024

History

3 Commits
Jul 11, 2024
Jul 11, 2024
Jul 11, 2024
Jul 11, 2024
Jul 11, 2024
Jul 11, 2024
Jul 11, 2024
Jul 11, 2024

Repository files navigation

Policy Gradient

Experiment with Policy Gradient methods (description), as well as variance reduction.

Current implementation:

  • Continuous and discrete environments
  • Baseline network for variance reduction

Usage

Setup environment

$ conda env create -f [environment.yml | environment_cuda.yml]
$ conda activate [policy_grad | policy_grad_cuda]

Run training

$ python main.py --config_filename config_filename

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages