forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 75
Pull requests: HabanaAI/vllm-fork
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Extend accuracy tests for models that we support
#824
opened Feb 13, 2025 by
AnetaKaczynska
Loading…
[DNM]Deepseek r1 g2opt - this is only for checking codes diff
#822
opened Feb 12, 2025 by
xuechendi
Loading…
Update documentation to reflect current bucket defaults
#817
opened Feb 12, 2025 by
nngokhale
Loading…
Bump transformers from 4.47.0 to 4.48.0
dependencies
Pull requests that update a dependency file
#815
opened Feb 11, 2025 by
dependabot
bot
Loading…
Fix sporadic issue in async_engine/test_api_server tests
#794
opened Feb 7, 2025 by
akarnows
Loading…
Support qwenvl model for HPU
New Model
Issue o PR to enable a new model
#793
opened Feb 7, 2025 by
yingjie-han
Loading…
[DEEPSEEK_V3/R1] includes features of fp8 dequant, MLA, Expert parallelism
#792
opened Feb 6, 2025 by
xuechendi
Loading…
[DO NOT MERGE][PoC] Mark dynamic shapes in torch.compile mode
#755
opened Jan 29, 2025 by
kzawora-intel
•
Draft
Previous Next
ProTip!
Updated in the last three days: updated:>2025-02-10.