Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Hardware][TPU] Support MoE with Pallas GMM kernel tpu Related to Google TPUs
#6457 opened Jul 16, 2024 by WoosukKwon Loading…
[Misc] Log spec decode metrics ready
#6454 opened Jul 15, 2024 by comaniac Loading…
[Model] H2O Danube3 Collection
#6451 opened Jul 15, 2024 by g-eoj Draft
[Not for review] PP ADAG
#6448 opened Jul 15, 2024 by ruisearch42 Draft
[Doc] Add documentations for nightly benchmarks
#6412 opened Jul 13, 2024 by KuntaiDu Loading…
[Model] Pipeline parallel support for Mixtral
#6403 opened Jul 13, 2024 by binxuan Loading…
torch.compile based model optimizer
#6377 opened Jul 12, 2024 by bnellnm Draft
Fix the lm_head in gptbigcode in lora mode
#6357 opened Jul 12, 2024 by maxdebayser Loading…
[Bugfix] Fix Ray Metrics API usage
#6354 opened Jul 11, 2024 by Yard1 Loading…
[Kernel] Fix identical branches
#6344 opened Jul 11, 2024 by stevegrubb Loading…
ProTip! What’s not been updated in a month: updated:<2024-06-16.