Skip to content

Actions: deepspeedai/DeepSpeed

nv-lightning-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,348 workflow run results
4,348 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Zero2: avoid graph breaks in torch.compile by using param_idx
nv-lightning-v100 #13831: Pull request #6803 synchronize by loadams
December 19, 2024 17:36 6m 23s nelyahu:zero2_param_idx
December 19, 2024 17:36 6m 23s
Cleanup ops/transformer/inference tests
nv-lightning-v100 #13829: Pull request #6830 synchronize by loadams
December 19, 2024 17:32 6m 5s loadams/transformers-inference
December 19, 2024 17:32 6m 5s
Cleanup ops/transformer/inference tests
nv-lightning-v100 #13828: Pull request #6830 synchronize by loadams
December 19, 2024 17:27 5m 10s loadams/transformers-inference
December 19, 2024 17:27 5m 10s
Cleanup ops/transformer/inference tests
nv-lightning-v100 #13827: Pull request #6830 synchronize by loadams
December 19, 2024 17:25 2m 42s loadams/transformers-inference
December 19, 2024 17:25 2m 42s
hpu_accelerator: use torch.use_deterministic_algorithms
nv-lightning-v100 #13824: Pull request #6897 opened by nelyahu
December 19, 2024 07:23 3m 11s nelyahu:patch-2
December 19, 2024 07:23 3m 11s
nv-lightning-v100
nv-lightning-v100 #13823: Scheduled
December 19, 2024 00:22 1h 47m 16s master
December 19, 2024 00:22 1h 47m 16s
Allow to compile collective for PT > 2.3
nv-lightning-v100 #13822: Pull request #6674 reopened by loadams
December 18, 2024 21:53 2h 31m 55s nelyahu:compile_collectives
December 18, 2024 21:53 2h 31m 55s
Allow to compile collective for PT > 2.3
nv-lightning-v100 #13821: Pull request #6674 synchronize by loadams
December 18, 2024 21:07 39m 26s nelyahu:compile_collectives
December 18, 2024 21:07 39m 26s
Copy #6674: Allow to compile collective for PT > 2.3
nv-lightning-v100 #13820: Pull request #6894 opened by loadams
December 18, 2024 21:01 3h 17m 48s loadams/test-compile-collectives
December 18, 2024 21:01 3h 17m 48s
Fix checkpointable_layers Logic
nv-lightning-v100 #13819: Pull request #6881 synchronize by Quentin-Anthony
December 18, 2024 20:25 2h 2m 14s Quentin-Anthony:qanthony/fix-act-recomp
December 18, 2024 20:25 2h 2m 14s
Support latest transformers with DSChat
nv-lightning-v100 #13817: Pull request #6711 synchronize by loadams
December 18, 2024 20:24 1h 59m 0s loadams/fix-ds-chat-transformers
December 18, 2024 20:24 1h 59m 0s
Fix error caused by all_reduce call in domino
nv-lightning-v100 #13814: Pull request #6880 synchronize by hwchen2017
December 18, 2024 18:02 1h 20m 12s hongwei/fix_domino_allreduce
December 18, 2024 18:02 1h 20m 12s
Stage3: Use new torch grad accumulation hooks API
nv-lightning-v100 #13813: Pull request #6773 synchronize by loadams
December 18, 2024 17:55 13m 15s deepcharm:stage3-use-new-grad-acc-api
December 18, 2024 17:55 13m 15s
Zero2: avoid graph breaks in torch.compile by using param_idx
nv-lightning-v100 #13812: Pull request #6803 synchronize by loadams
December 18, 2024 17:55 23m 32s nelyahu:zero2_param_idx
December 18, 2024 17:55 23m 32s
Update version.txt after 0.16.2 release
nv-lightning-v100 #13811: Pull request #6893 opened by loadams
December 18, 2024 17:52 10m 6s AutoPR/0.16.2
December 18, 2024 17:52 10m 6s
Inference ops unit test failures/fixes
nv-lightning-v100 #13808: Pull request #6879 synchronize by loadams
December 18, 2024 16:53 21m 0s loadams/inference-ops-test-repro
December 18, 2024 16:53 21m 0s
Stage3: Use new torch grad accumulation hooks API
nv-lightning-v100 #13807: Pull request #6773 synchronize by loadams
December 18, 2024 16:51 9m 42s deepcharm:stage3-use-new-grad-acc-api
December 18, 2024 16:51 9m 42s
Zero2: avoid graph breaks in torch.compile by using param_idx
nv-lightning-v100 #13806: Pull request #6803 synchronize by loadams
December 18, 2024 16:51 3m 11s nelyahu:zero2_param_idx
December 18, 2024 16:51 3m 11s
Update code owners
nv-lightning-v100 #13805: Pull request #6890 synchronize by loadams
December 18, 2024 16:30 3m 11s olruwase/code_owners
December 18, 2024 16:30 3m 11s
Use ds-specific module id to avoid conflicts
nv-lightning-v100 #13803: Pull request #6847 synchronize by tjruwase
December 18, 2024 13:59 3m 10s olruwase/pr_6772
December 18, 2024 13:59 3m 10s
Update code owners
nv-lightning-v100 #13802: Pull request #6890 opened by tjruwase
December 18, 2024 12:04 3m 11s olruwase/code_owners
December 18, 2024 12:04 3m 11s
Fix error caused by all_reduce call in domino
nv-lightning-v100 #13801: Pull request #6880 synchronize by tjruwase
December 18, 2024 11:51 3m 10s hongwei/fix_domino_allreduce
December 18, 2024 11:51 3m 10s