Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[fix] fix SM check for Flashinfer TRTLLM MOE nvidia
#30314 opened Dec 9, 2025 by jiahanc Loading…
5 tasks
[Misc][Quantization] Clarify the intent of GGUF FusedMoE weight materialization
#30310 opened Dec 9, 2025 by a4lg Loading…
1 of 5 tasks
[DCP][Bugfix][CI] Fix accuracy issue of DCP when using FLASH_ATTN_MLA ready ONLY add when PR is ready to merge/full CI is needed v1
#30309 opened Dec 9, 2025 by FENP Loading…
3 of 5 tasks
[bugfix][quantization] fix quark qwen3 kv_cache quantization qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#30308 opened Dec 9, 2025 by haoyangli-amd Loading…
[Model][Quantization] Fix / Add GGUF support for Qwen2 MoE models qwen Related to Qwen models
#30307 opened Dec 9, 2025 by a4lg Loading…
3 of 5 tasks
[Bugfix] Qwen 3 VL Embedding loading qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#30303 opened Dec 9, 2025 by noooop Loading…
5 tasks
[Misc] Pass reasoning to deepseekV32 tokenizer deepseek Related to DeepSeek models frontend
#30302 opened Dec 9, 2025 by kingsmad Draft
5 tasks
[ResponsesAPI] Add GPTOSS MCP tool streaming frontend gpt-oss Related to GPT-OSS models
#30301 opened Dec 9, 2025 by qandrew Loading…
Main 20251205 amd ci/build ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#30298 opened Dec 9, 2025 by Alexei-V-Ivanov-AMD Loading…
[Core] Add SLA-tiered scheduling (opt-in) and docs documentation Improvements or additions to documentation v1
#30297 opened Dec 9, 2025 by ProdByBuddha Loading…
3 of 5 tasks
Adding quantized fused_moe_lora support
#30286 opened Dec 9, 2025 by yugong333 Loading…
5 tasks
Ensure minimum frames for GLM 4.6V compatibility
#30285 opened Dec 9, 2025 by gh-wf Loading…
1 of 3 tasks
[CI/Build] Ignore data_parallel_size_local
#30281 opened Dec 8, 2025 by rjrock Loading…
3 of 5 tasks
[CPU][Bugfix] Fix CPU Profiler issue v1
#30278 opened Dec 8, 2025 by zhili03 Loading…
[BugFix] Fix non detected failing tests ci/build ready ONLY add when PR is ready to merge/full CI is needed
#30277 opened Dec 8, 2025 by ilmarkov Loading…
5 tasks
[ROCM][CI] Fix AMD Examples Test Group ci/build documentation Improvements or additions to documentation rocm Related to AMD ROCm
#30276 opened Dec 8, 2025 by Concurrensee Loading…
ProTip! What’s not been updated in a month: updated:<2025-11-08.