Skip to content

Actions: deepspeedai/DeepSpeed

nv-accelerate-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,207 workflow run results
4,207 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

nv-accelerate-v100
nv-accelerate-v100 #12635: Scheduled
January 3, 2025 00:07 11m 1s master
January 3, 2025 00:07 11m 1s
Cleanup ops/transformer/inference tests
nv-accelerate-v100 #12633: Pull request #6830 synchronize by loadams
January 2, 2025 18:47 7m 19s loadams/transformers-inference
January 2, 2025 18:47 7m 19s
Autotp training
nv-accelerate-v100 #12631: Pull request #6922 synchronize by inkcherry
January 2, 2025 03:54 3m 56s inkcherry:autotp_training
January 2, 2025 03:54 3m 56s
nv-accelerate-v100
nv-accelerate-v100 #12630: Scheduled
January 2, 2025 00:07 3m 51s master
January 2, 2025 00:07 3m 51s
nv-accelerate-v100
nv-accelerate-v100 #12629: Scheduled
January 1, 2025 00:08 3m 54s master
January 1, 2025 00:08 3m 54s
Add fp8_gemm fallback for non-triton systems
nv-accelerate-v100 #12628: Pull request #6916 synchronize by oelayan7
December 31, 2024 12:01 11m 23s oelayan7:fp8_gemm_no_triton
December 31, 2024 12:01 11m 23s
[inf] Add config var to enable keeping module on host
nv-accelerate-v100 #12627: Pull request #6846 synchronize by oelayan7
December 31, 2024 07:32 3m 55s oelayan7:keep_module_on_host
December 31, 2024 07:32 3m 55s
nv-accelerate-v100
nv-accelerate-v100 #12626: Scheduled
December 31, 2024 00:07 12m 31s master
December 31, 2024 00:07 12m 31s
Use ds-specific module id to avoid conflicts
nv-accelerate-v100 #12625: Pull request #6847 synchronize by loadams
December 30, 2024 21:04 11m 9s olruwase/pr_6772
December 30, 2024 21:04 11m 9s
[BUG FIX]:fix get torch.version.cuda error when cuda is None in rocm
nv-accelerate-v100 #12624: Pull request #6909 synchronize by loadams
December 30, 2024 21:02 12m 48s hj-wei:dev_hjwei
December 30, 2024 21:02 12m 48s
Stage3: Use new torch grad accumulation hooks API
nv-accelerate-v100 #12623: Pull request #6773 synchronize by loadams
December 30, 2024 18:54 21m 1s deepcharm:stage3-use-new-grad-acc-api
December 30, 2024 18:54 21m 1s
Fix checkpointable_layers Logic
nv-accelerate-v100 #12622: Pull request #6881 synchronize by loadams
December 30, 2024 18:53 33m 40s Quentin-Anthony:qanthony/fix-act-recomp
December 30, 2024 18:53 33m 40s
Add fp8_gemm fallback for non-triton systems
nv-accelerate-v100 #12621: Pull request #6916 synchronize by loadams
December 30, 2024 17:57 1h 4m 34s oelayan7:fp8_gemm_no_triton
December 30, 2024 17:57 1h 4m 34s
fix: RuntimeError for UCP large DP
nv-accelerate-v100 #12620: Pull request #6918 synchronize by loadams
December 30, 2024 17:17 17m 21s saforem2/ucp-bug
December 30, 2024 17:17 17m 21s
nv-accelerate-v100
nv-accelerate-v100 #12617: Scheduled
December 30, 2024 00:07 3m 54s master
December 30, 2024 00:07 3m 54s
fix: RuntimeError for UCP large DP
nv-accelerate-v100 #12616: Pull request #6918 opened by saforem2
December 29, 2024 18:23 10m 58s saforem2/ucp-bug
December 29, 2024 18:23 10m 58s
nv-accelerate-v100
nv-accelerate-v100 #12615: Scheduled
December 29, 2024 00:08 3m 53s master
December 29, 2024 00:08 3m 53s
Use ds-specific module id to avoid conflicts
nv-accelerate-v100 #12614: Pull request #6847 synchronize by tjruwase
December 28, 2024 19:44 11m 30s olruwase/pr_6772
December 28, 2024 19:44 11m 30s
nv-accelerate-v100
nv-accelerate-v100 #12613: Scheduled
December 28, 2024 00:07 3m 54s master
December 28, 2024 00:07 3m 54s
[BUG FIX]:fix get torch.version.cuda error when cuda is None in rocm
nv-accelerate-v100 #12612: Pull request #6909 synchronize by hj-wei
December 27, 2024 03:06 11m 13s hj-wei:dev_hjwei
December 27, 2024 03:06 11m 13s
nv-accelerate-v100
nv-accelerate-v100 #12609: Scheduled
December 27, 2024 00:07 3m 56s master
December 27, 2024 00:07 3m 56s
Stage3: Use new torch grad accumulation hooks API
nv-accelerate-v100 #12608: Pull request #6773 synchronize by loadams
December 26, 2024 20:09 18m 24s deepcharm:stage3-use-new-grad-acc-api
December 26, 2024 20:09 18m 24s