-
Notifications
You must be signed in to change notification settings - Fork 354
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Support CUDA Graph for MoE models (#1233)
* Align RNG tracker with megatron Signed-off-by: Robin Zhang <[email protected]> Co-authored-by: Yifei Song <[email protected]> * Fix module_params order and warmup bug in cudagraph Signed-off-by: Robin Zhang <[email protected]> Co-authored-by: Yifei Song <[email protected]> * Add fp8_group argument and fix fp8 accuracy issue for cudagraph Signed-off-by: Robin Zhang <[email protected]> Co-authored-by: Yifei Song <[email protected]> * Add TE modules and weights filters to support MoE models Signed-off-by: Robin Zhang <[email protected]> Co-authored-by: Yifei Song <[email protected]> * Revert self.fp8 Signed-off-by: Robin Zhang <[email protected]> * Use hooks to filter module params Signed-off-by: Robin Zhang <[email protected]> * Filter all TE modules in hooks Signed-off-by: Robin Zhang <[email protected]> Co-authored-by: Yifei Song <[email protected]> * Format code Signed-off-by: Robin Zhang <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update graph.py Signed-off-by: Xin Yao <[email protected]> * Revert CudaRNGStatesTracker Signed-off-by: Robin Zhang <[email protected]> * Format Update Signed-off-by: Yifei Song <[email protected]> * Revert "Use hooks to filter module params" This reverts commit 73a22e2. Signed-off-by: Yifei Song <[email protected]> * Remove filtering module params Signed-off-by: Robin Zhang <[email protected]> --------- Signed-off-by: Robin Zhang <[email protected]> Signed-off-by: Xin Yao <[email protected]> Signed-off-by: Yifei Song <[email protected]> Co-authored-by: Yifei Song <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Xin Yao <[email protected]> Co-authored-by: Xin Yao <[email protected]> Co-authored-by: Tim Moon <[email protected]>
- Loading branch information
1 parent
8952bc4
commit ae393e8
Showing
5 changed files
with
97 additions
and
17 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters