Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add maskable GraphPPO based on sb3_contrib.MaskablePPO + GNN for domains with graph observations #444

Merged
merged 1 commit into from
Jan 10, 2025

Conversation

nhuet
Copy link
Contributor

@nhuet nhuet commented Nov 29, 2024

  • Derive MaskableGraphPPO from MaskablePPO (and also GraphOnPolicyAlgorithm to reuse work done for GraphPPO)
  • Derive MaskableGNNActorCriticPolicy from MaskableActorCriticPolicy (and similarly from _BaseGNNActorCriticPolicy to share code with GNNActorCriticPolicy), idem for MaskableMultiInputGNNActorCriticPolicy.
  • Update stable_baselines3 scikit-decide wrapper with a new argument use_action_masking in order to wrap the domain in an environment exposing action_masks method (redirecting towards domain.get_action_mask())
  • We add examples for GraphMaze and GraphJspDomain so that the solver propose only applicable actions

@nhuet nhuet force-pushed the gnn-sb3-maskable branch 2 times, most recently from f52083e to 9550a07 Compare December 5, 2024 09:20
@nhuet nhuet force-pushed the gnn-sb3-maskable branch 2 times, most recently from b126b46 to 30b9f91 Compare December 17, 2024 08:55
@nhuet nhuet marked this pull request as ready for review December 17, 2024 08:55
@nhuet nhuet force-pushed the gnn-sb3-maskable branch from 3bb3e6d to 7c7e89c Compare January 8, 2025 16:08
@fteicht fteicht merged commit dcf2120 into airbus:master Jan 10, 2025
33 checks passed
@nhuet nhuet deleted the gnn-sb3-maskable branch January 20, 2025 09:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants