Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Skip Mamba splitting during async KV load bug Something isn't working v1
#41635 opened May 4, 2026 by arpera Contributor Loading…
2 of 4 tasks
[CI][ROCm] Install RIXL wheel in final stage of Dockerfile.rocm ci/build rocm Related to AMD ROCm
#41634 opened May 4, 2026 by simondanielsson Contributor Draft
4 tasks
[EPLB] Niixl communicator optimization. Zero-copy transfers
#41633 opened May 4, 2026 by ilmarkov Contributor Draft
4 tasks
[Bugfix][GLM-4.7] Skip schema injection for Responses named-function bug Something isn't working tool-calling
#41631 opened May 4, 2026 by hnt2601 Contributor Loading…
4 of 5 tasks
Support GDN conv-state splits for NIXL kv-connector v1
#41628 opened May 4, 2026 by arpera Contributor Loading…
2 of 4 tasks
[Bug] Skip KVConnector lookup when request opts out of prefix cache bug Something isn't working v1
#41625 opened May 4, 2026 by joerowell Contributor Loading…
1 of 5 tasks
Fix DeepSeek V4 reasoning before tool calls deepseek Related to DeepSeek models
#41624 opened May 4, 2026 by mertunsall Contributor Draft
[Bugfix][Mamba] IMA in causal_conv1d kernel for long sequences bug Something isn't working
#41617 opened May 4, 2026 by Flink-ddd Contributor Loading…
feat(kernels): Migrate mm_encoder_attn to vLLM IR intel-gpu Related to Intel GPU nvidia rocm Related to AMD ROCm
#41613 opened May 4, 2026 by harshaljanjani Loading…
4 of 5 tasks
Bump protobuf from 6.33.6 to 7.34.1 ci/build dependencies Pull requests that update a dependency file nvidia
#41610 opened May 4, 2026 by dependabot Bot Loading…
Update quack-kernels requirement from >=0.3.3 to >=0.4.1 ci/build dependencies Pull requests that update a dependency file nvidia
#41609 opened May 4, 2026 by dependabot Bot Loading…
Bump pyrate-limiter from 3.7.0 to 4.1.0 ci/build dependencies Pull requests that update a dependency file nvidia
#41608 opened May 4, 2026 by dependabot Bot Loading…
Bump fsspec from 2024.12.0 to 2026.4.0 ci/build dependencies Pull requests that update a dependency file nvidia
#41607 opened May 4, 2026 by dependabot Bot Loading…
Bump the minor-update group with 140 updates ci/build dependencies Pull requests that update a dependency file nvidia rocm Related to AMD ROCm
#41606 opened May 4, 2026 by dependabot Bot Loading…
[Bugfix] Fix /wake_up crash on hybrid models (Mamba/DeltaNet) bug Something isn't working v1
#41602 opened May 4, 2026 by kevglynn Loading…
1 of 3 tasks
DeepSeekv4 ROCm Optimization ( based on PR#40871 ) ci/build deepseek Related to DeepSeek models rocm Related to AMD ROCm v1
#41601 opened May 4, 2026 by bobofang11235 Loading…
4 tasks
[Model] Support TranslateGemma-12b-it frontend
#41599 opened May 4, 2026 by zhangj1an Contributor Loading…
3 of 4 tasks
[ROCm] Fix TurboQuant shape mismatch on non-power-of-2 head_dim rocm Related to AMD ROCm v1
#41597 opened May 4, 2026 by naarob Loading…
[ROCm] Fix circular import in GCN arch detection rocm Related to AMD ROCm
#41596 opened May 4, 2026 by naarob Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.