Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

Geonhee-LEE / rl-collision-avoidance Public

forked from Acmece/rl-collision-avoidance

Notifications You must be signed in to change notification settings
Fork 0
Star 1

Code
Issues 8
Pull requests
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Implement RL algorithms #5

Open

3 of 8 tasks

Geonhee-LEE opened this issue Aug 21, 2020 · 5 comments

Open

3 of 8 tasks

Implement RL algorithms #5

Geonhee-LEE opened this issue Aug 21, 2020 · 5 comments

Assignees

Comments

Copy link

Owner

Geonhee-LEE commented Aug 21, 2020 •

edited

Loading

Value based RL
- DQN
- Rainbow DQN
- CQL
Value based + Policy based RL
- DDPG
- TD3
Policy based RL
- PG
- TRPO
- PPO
Model based RL
- [ ]

The text was updated successfully, but these errors were encountered:

All reactions

Geonhee-LEE assigned Geonhee-LEE and CzJaewan

Copy link

Owner Author

Geonhee-LEE commented Aug 25, 2020

@CzJaewan 이전에 얘기했던 value based, policy based 분류는 https://spinningup.openai.com/en/latest/spinningup/rl_intro2.html#citations-below를 참조하면 좋을 듯.

0825 진행사항) 지금은 개념다시 잡으면서 Everett training 진행 중 and 전통적인 물체 회피 빌드 및 테스트 진행 중.

ps) 보고바람

All reactions

Sorry, something went wrong.

Copy link

Collaborator

CzJaewan commented Aug 27, 2020

@Geonhee-LEE value based RL, Policy based RL 등의 Algorithm들은 PPO를 통한 실제 테스트 까지 완료한 후 진행할 예정

이전 알고리즘 구현 및 테스트 진행 사항

Code 작성 완료 : DDPG, TD3, TRPO, SAC
Code Train 정상 작동 : DDPG
Code 수정 필요 : TD3, SAC
Code train test 필요 : TRPO

Geonhee-LEE reacted with thumbs up emoji

All reactions

👍 1 reaction

Sorry, something went wrong.

Copy link

Owner Author

Geonhee-LEE commented Sep 22, 2020

@CzJaewan 각각 소스의 참조 repo 좀 업데이트 부탁드립니다.

All reactions

Sorry, something went wrong.

Copy link

Collaborator

CzJaewan commented Sep 22, 2020 •

edited

Loading

DDPG : yanpanlau/DDPG-Keras-Torcs
TRPO : reinforcement-learning-kr/pg_travel
TD3 : https://github.com/nikhilbarhate99/TD3-PyTorch-BipedalWalker-v2, https://github.com/henry32144/TD3-Pytorch
SAC : https://github.com/pranz24/pytorch-soft-actor-critic/blob/master/sac.py

All reactions

Sorry, something went wrong.

Copy link

Owner Author

Geonhee-LEE commented Sep 24, 2020 •

edited

Loading

SAC 구현

https://github.com/Geonhee-LEE/rl-collision-avoidance/tree/sac

argparse 추가
참조한 SAC에 맞게 구조 변경
현재 mpi4python으로 사용하기 위해 shape 맞춰주는 중. 참조
- MPI for Python, mpi4py

All reactions

Sorry, something went wrong.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Assignees

Labels

None yet

Projects

None yet

Milestone

No milestone

Development

No branches or pull requests

2 participants

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.