Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add multimodal input method in the documentation documentation Improvements or additions to documentation
#31601 opened Jan 1, 2026 by labAxiaoming Loading…
5 tasks
[Qwen3-Omni] Prefer CUDA for faster Whisper audio feature extraction nvidia qwen Related to Qwen models
#31598 opened Jan 1, 2026 by Jzz1943 Loading…
Fix audio mono dimension documentation Improvements or additions to documentation multi-modality Related to multi-modality (#4194) qwen Related to Qwen models
#31595 opened Jan 1, 2026 by jeremyteboul Loading…
Fix flashinfer experts quant config hack llama Related to Llama models nvidia
#31593 opened Jan 1, 2026 by robertgshaw2-redhat Draft
5 tasks
[Misc] Tidy up some spec decode logic in GPUModelRunner ready ONLY add when PR is ready to merge/full CI is needed v1
#31591 opened Dec 31, 2025 by njhill Loading…
[Bugfix] Replace BaseException with specific exceptions in FLA utils ready ONLY add when PR is ready to merge/full CI is needed
#31590 opened Dec 31, 2025 by c0de128 Loading…
2 of 3 tasks
[Bugfix] Narrow broad exceptions in rank detection functions
#31589 opened Dec 31, 2025 by c0de128 Loading…
2 of 3 tasks
[Bugfix][Hardware][ROCm] Narrow broad exception in PyNCCL library loading rocm Related to AMD ROCm
#31587 opened Dec 31, 2025 by c0de128 Loading…
2 of 3 tasks
[Bugfix] Narrow broad exception in custom all-reduce detection
#31586 opened Dec 31, 2025 by c0de128 Loading…
2 of 3 tasks
[Bug] Revert torch warning fix bug Something isn't working ready ONLY add when PR is ready to merge/full CI is needed v1
#31585 opened Dec 31, 2025 by yewentao256 Loading…
[Model] Support IQuestCoder model new-model Requests to new models
#31575 opened Dec 31, 2025 by yxing-bj Loading…
5 tasks
[Bugfix] Fix activation quantization for compressed-tensors W4A16 ready ONLY add when PR is ready to merge/full CI is needed
#31572 opened Dec 31, 2025 by Tmn07 Loading…
feat: support LoRA for DeepSeek-OCR(Language Model part) deepseek Related to DeepSeek models documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
#31569 opened Dec 31, 2025 by zhima771 Loading…
[Model] Support SentenceTransformers V6 reranker config documentation Improvements or additions to documentation frontend
#31563 opened Dec 31, 2025 by noooop Draft
5 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.