PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
-
Updated
Jan 7, 2025 - Python
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Repo for the Deep Reinforcement Learning Nanodegree program
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
OpenAI Gym environments for an open-source quadruped robot (SpotMicro)
A curated list of awesome model based RL resources (continually updated)
A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.
Really Fast End-to-End Jax RL Implementations
NMA deep learning course
A curated list of Decision Transformer resources (continually updated)
MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces.
A curated list of Monte Carlo tree search papers with implementations.
A PyTorch library for building deep reinforcement learning agents.
Guided Policy Search
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
🤖 The Full Process Python Package for Robot Learning from Demonstration and Robot Manipulation
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
PyTorch C++ Reinforcement Learning
Code for paper "Computation Offloading Optimization for UAV-assisted Mobile Edge Computing: A Deep Deterministic Policy Gradient Approach"
Add a description, image, and links to the reinforcement-learning-algorithms topic page so that developers can more easily learn about it.
To associate your repository with the reinforcement-learning-algorithms topic, visit your repo's landing page and select "manage topics."