Skip to content

Pull requests: microsoft/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Use accelerator to replace cuda in setup and runner
#5769 opened Jul 15, 2024 by Andy666G Loading…
Add fp8-fused gemm kernel
#5764 opened Jul 11, 2024 by sfc-gh-reyazda Loading…
Misplaced global variable warned
#5725 opened Jul 4, 2024 by anferico Loading…
Find ROCm on Fedora
#5705 opened Jun 28, 2024 by trixirt Loading…
sequence parallel with communication overlap
#5691 opened Jun 21, 2024 by inkcherry Loading…
Add and Remove ZeRO 3 Hooks
#5658 opened Jun 13, 2024 by jomayeri Loading…
Unpin transformers version
#5650 opened Jun 12, 2024 by loadams Loading…
Hybrid Offloading for ZeRO3
#5625 opened Jun 7, 2024 by tohtana Draft
fix: quantization with DeepSpeed HE
#5624 opened Jun 6, 2024 by Atry Loading…
Add support for Phi-3 small to FastGen
#5614 opened Jun 4, 2024 by adk9 Draft
Upgrade HPU image to v1.16.2.
#5610 opened Jun 4, 2024 by vshekhawat-hlab Loading…
Update profiler.py
#5584 opened May 29, 2024 by gameofdimension Loading…
reduce cpu host overhead when using moe
#5578 opened May 29, 2024 by ranzhejiang Loading…
Reuse KV cache of prefixes
#5572 opened May 27, 2024 by tohtana Draft
Add support for Microsoft Phi-3 model to DeepSpeed-FastGen
#5559 opened May 21, 2024 by adk9 Loading…
ProTip! Updated in the last three days: updated:>2024-07-13.