-
Notifications
You must be signed in to change notification settings - Fork 4k
Pull requests: microsoft/DeepSpeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Update the list of supported models in the Chinese README of fastgen
#5773
opened Jul 16, 2024 by
beep-bebop
•
Queued
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
#5763
opened Jul 10, 2024 by
mauryaavinash95
•
Draft
move is_checkpointable call reducing torch.compile Graph breaks
#5759
opened Jul 9, 2024 by
NirSonnenschein
Loading…
Update xpu-max1100.yml with new config and add some tests
#5668
opened Jun 17, 2024 by
Liangliang-Ma
Loading…
reduce all-to-all communication volume when both expert and non-expert are tensor-parallel
#5626
opened Jun 7, 2024 by
taozhiwei
Loading…
FastGen H100 MoE support: Add PyTorch multi-gemm MOE implementation
#5586
opened May 29, 2024 by
HeyangQin
Loading…
Add support for Microsoft Phi-3 model to DeepSpeed-FastGen
#5559
opened May 21, 2024 by
adk9
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2024-07-13.