We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
无监督预训练: Variational Option Discovery Algorithms :: real hierarchical DIVERSITY IS ALL YOU NEED: 有基于SAC的代码
Model-Ensemble Trust-Region Policy Optimization, Kurutach et al, 2018. Algorithm: ME-TRPO. 有 code
Model-Based Reinforcement Learning via Meta-Policy Optimization, Clavera et al, 2018. Algorithm: MB-MPO.
EMI: EXPLORATION WITH MUTUAL INFORMATION MAXIMIZING STATE AND ACTION EMBEDDINGS
polo exploration-- Randomized Prior Functions for Deep Reinforcement Learning ---relate to RND;
The text was updated successfully, but these errors were encountered:
No branches or pull requests
无监督预训练:
Variational Option Discovery Algorithms :: real hierarchical
DIVERSITY IS ALL YOU NEED: 有基于SAC的代码
Model-Ensemble Trust-Region Policy Optimization, Kurutach et al, 2018. Algorithm: ME-TRPO.
有 code
Model-Based Reinforcement Learning via Meta-Policy Optimization, Clavera et al, 2018. Algorithm: MB-MPO.
EMI: EXPLORATION WITH MUTUAL INFORMATION MAXIMIZING STATE AND ACTION EMBEDDINGS
polo exploration-- Randomized Prior Functions for Deep Reinforcement Learning ---relate to RND;
The text was updated successfully, but these errors were encountered: