Skip to content

Pull requests: huggingface/text-generation-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Don't error on OpenAI valid top_p values.
#2231 opened Jul 15, 2024 by ErikKaum Loading…
doc: Add metrics documentation and add a 'Reference' section documentation Improvements or additions to documentation
#2230 opened Jul 15, 2024 by Hugoch Loading…
2 of 5 tasks
Add support for Deepseek V2
#2224 opened Jul 12, 2024 by danieldk Loading…
5 tasks
usage stats and crash reports
#2220 opened Jul 11, 2024 by ErikKaum Loading…
5 tasks
feat: Add load tests
#2217 opened Jul 11, 2024 by Hugoch Loading…
1 of 5 tasks
added tie_weights support to mlp speculator
#2215 opened Jul 10, 2024 by JRosenkranz Loading…
5 tasks
Move to new cluster
#2208 opened Jul 9, 2024 by glegendre01 Loading…
5 tasks
fix: refactor adapter weight loading and mapping
#2193 opened Jul 5, 2024 by drbh Loading…
misc: update vllm dependency to support attention size 160
#2187 opened Jul 4, 2024 by PaoloAlbano Loading…
5 tasks
feat: add simple ttft load_test
#2170 opened Jul 2, 2024 by drbh Loading…
feat: add test to view batch speedup amount
#2168 opened Jul 2, 2024 by drbh Loading…
Add API_Key for Auth
#2142 opened Jun 28, 2024 by KevinDuffy94 Loading…
2 of 5 tasks
Fixing AMD CI
#2109 opened Jun 24, 2024 by Narsil Loading…
5 tasks
Add support for Docker Compose
#2063 opened Jun 12, 2024 by StefanDanielSchwarz Loading…
1 of 5 tasks
Add FP8 KVCache support
#2028 opened Jun 6, 2024 by mht-sharma Loading…
1 of 4 tasks
feat: re-allocate pages dynamically
#2024 opened Jun 5, 2024 by OlivierDehaene Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.