Skip to content

Pull requests: microsoft/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Cleanup ops/transformer/inference tests
#6925 opened Jan 3, 2025 by loadams Loading…
Autotp training
#6922 opened Jan 2, 2025 by inkcherry Loading…
fix: RuntimeError for UCP large DP
#6918 opened Dec 29, 2024 by saforem2 Loading…
Add fp8_gemm fallback for non-triton systems
#6916 opened Dec 26, 2024 by oelayan7 Loading…
Tecorigin sdaa accelerator
#6903 opened Dec 23, 2024 by siqi654321 Loading…
Use ds-specific module id to avoid conflicts
#6847 opened Dec 10, 2024 by tjruwase Loading…
Support pure meta model lm_head tp
#6812 opened Dec 2, 2024 by Yejing-Lai Loading…
Check transformers version in BLOOM for inference v1
#6766 opened Nov 19, 2024 by lekurile Loading…
BLOOM fixes for DS Legacy Inference
#6765 opened Nov 19, 2024 by lekurile Draft
Fix building on Windows with presence of Triton
#6749 opened Nov 14, 2024 by woct0rdho Loading…
Support latest transformers with DSChat
#6711 opened Nov 4, 2024 by loadams Loading…
Update MII tests to support transformers latest
#6686 opened Oct 29, 2024 by loadams Loading…
modify_load_save_model
#6626 opened Oct 15, 2024 by ssklzx Loading…
Improve consistency of zero_grad
#6554 opened Sep 18, 2024 by tohtana Draft
Set shuffle=True by default in data_sampler
#6531 opened Sep 13, 2024 by ranzhejiang Loading…
ProTip! Updated in the last three days: updated:>2025-01-03.