Releases: thakorneyp11/reinforcement-learning
Releases · thakorneyp11/reinforcement-learning
v1-2023.16.0
Overview
- initial version on implementing reinforcement learning approaches from scratch
Current Features
Implemented Algorithms
- Cross Entropy method
- Monte Carlo: MC Prediction, MC Control
- Temporal Difference: N-step SARSA, SARSAmax, Expected SARSA
- Temporal Difference: Double Q-Learning
- Deep Q Network (DQN)
Implemented Tools
- Customized Gridworld environment
- ONNX model conversion and usage script
Newly Added Features
- Feat: add Monte Carlo algorithm and custom Gridworld environment (PR #1)
- Feat: add Temporal Difference algorithm (PR #2)
- Feat: add algorithm conversion to ONNX format (PR #3)
- Feat: add Cross Entropy method on Cartpole environment (PR #4)
- Feat: add Double Q-Learning on Taxi environment (PR #5)
- Feat: add DQN algorithm on MountainCar environment (PR #7)