Skip to content
Thakorn Swaengkit edited this page Jun 22, 2023 · 93 revisions

Algorithms Implemented

note: most algorithm explanation is included in their PRs

Algorithm Variants Implemented PR
1. Dynamic Programming
Value iteration -
Policy iteration -
2. Cross Entropy method
Cross Entropy method on CartPole environment cross_entropy/cross_entropy_cartpole.ipynb #4
3. Monte Carlo
MC Prediction and Control on custom Gridworld environment monte_carlo/mc_prediction.ipynb
monte_carlo/mc_control.ipynb
PR #1
MC Control on FrozenLake environment monte_carlo/mc_control_frozenlake.ipynb PR #1

4. Temporal Difference

5. Function Approximation

  • Function Approximation on MountainCar environment: ref github

6. Deep Q-Networks

  • (PR #7) Feat: add DQN algorithm on MountainCar environment
  • Deep Q-Network (DQN) on MountainCar environment: dqn/dqn_mountaincar.ipynb
  • N-step Deep Q-Network
  • Categorical Deep Q-Network
  • Double Deep Q-Network
  • Dueling Deep Q-Network
  • Dueling Double Deep Q-Network

7. Policy Gradient

8. Actor-Critic (AC)

9. Deep Deterministic Policy Gradient (DDPG)

X. Other Techniques

Advanced Algorithms

  • Hierarchical Reinforcement Learning
  • Multi-Agent Reinforcement Learning

Tools

1. Custom Gridworld environment

2. ONNX model conversion and usage

Clone this wiki locally