Skip to content

Releases: thakorneyp11/reinforcement-learning

v1-2023.16.0

23 Apr 21:11
7c51ee6
Compare
Choose a tag to compare

Overview

  • initial version on implementing reinforcement learning approaches from scratch

Current Features

Implemented Algorithms

  • Cross Entropy method
  • Monte Carlo: MC Prediction, MC Control
  • Temporal Difference: N-step SARSA, SARSAmax, Expected SARSA
  • Temporal Difference: Double Q-Learning
  • Deep Q Network (DQN)

Implemented Tools

  • Customized Gridworld environment
  • ONNX model conversion and usage script

Newly Added Features

  • Feat: add Monte Carlo algorithm and custom Gridworld environment (PR #1)
  • Feat: add Temporal Difference algorithm (PR #2)
  • Feat: add algorithm conversion to ONNX format (PR #3)
  • Feat: add Cross Entropy method on Cartpole environment (PR #4)
  • Feat: add Double Q-Learning on Taxi environment (PR #5)
  • Feat: add DQN algorithm on MountainCar environment (PR #7)