-
-
Notifications
You must be signed in to change notification settings - Fork 16.4k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Skip Mamba splitting during async KV load
bug
Something isn't working
v1
#41635
opened May 4, 2026 by
arpera
Contributor
Loading…
2 of 4 tasks
[CI][ROCm] Install RIXL wheel in final stage of Dockerfile.rocm
ci/build
rocm
Related to AMD ROCm
#41634
opened May 4, 2026 by
simondanielsson
Contributor
•
Draft
4 tasks
[Misc] Add common random prefix option to structured-output serving benchmark
performance
Performance-related issues
structured-output
#41632
opened May 4, 2026 by
viktorpusTT
Loading…
4 tasks
[Bugfix][GLM-4.7] Skip schema injection for Responses named-function
bug
Something isn't working
tool-calling
#41631
opened May 4, 2026 by
hnt2601
Contributor
Loading…
4 of 5 tasks
[NVFP4][fix] Fix
layer.weight -> w13 typo in NVFP4 MOE emulation kernel preparation
#41630
opened May 4, 2026 by
fxmarty-amd
Contributor
Loading…
Support GDN conv-state splits for NIXL
kv-connector
v1
#41628
opened May 4, 2026 by
arpera
Contributor
Loading…
2 of 4 tasks
[V1][DP][LB] Publish request counts at the start of each engine step
v1
#41626
opened May 4, 2026 by
vadiklyutiy
Collaborator
Loading…
[Bug] Skip KVConnector lookup when request opts out of prefix cache
bug
Something isn't working
v1
#41625
opened May 4, 2026 by
joerowell
Contributor
Loading…
1 of 5 tasks
Fix DeepSeek V4 reasoning before tool calls
deepseek
Related to DeepSeek models
#41624
opened May 4, 2026 by
mertunsall
Contributor
•
Draft
[Core] block_pool: sort the list of allocated free blocks
v1
#41621
opened May 4, 2026 by
da-x
Loading…
[Bugfix][Mamba] IMA in causal_conv1d kernel for long sequences
bug
Something isn't working
#41617
opened May 4, 2026 by
Flink-ddd
Contributor
Loading…
feat(kernels): Migrate mm_encoder_attn to vLLM IR
intel-gpu
Related to Intel GPU
nvidia
rocm
Related to AMD ROCm
#41613
opened May 4, 2026 by
harshaljanjani
Loading…
4 of 5 tasks
Bump protobuf from 6.33.6 to 7.34.1
ci/build
dependencies
Pull requests that update a dependency file
nvidia
#41610
opened May 4, 2026 by
dependabot
Bot
Loading…
Update quack-kernels requirement from >=0.3.3 to >=0.4.1
ci/build
dependencies
Pull requests that update a dependency file
nvidia
#41609
opened May 4, 2026 by
dependabot
Bot
Loading…
Bump pyrate-limiter from 3.7.0 to 4.1.0
ci/build
dependencies
Pull requests that update a dependency file
nvidia
#41608
opened May 4, 2026 by
dependabot
Bot
Loading…
Bump fsspec from 2024.12.0 to 2026.4.0
ci/build
dependencies
Pull requests that update a dependency file
nvidia
#41607
opened May 4, 2026 by
dependabot
Bot
Loading…
Bump the minor-update group with 140 updates
ci/build
dependencies
Pull requests that update a dependency file
nvidia
rocm
Related to AMD ROCm
#41606
opened May 4, 2026 by
dependabot
Bot
Loading…
DeepSeekv4 ROCm Optimization ( based on PR#40871 )
ci/build
deepseek
Related to DeepSeek models
rocm
Related to AMD ROCm
v1
#41601
opened May 4, 2026 by
bobofang11235
Loading…
4 tasks
[Model] Support TranslateGemma-12b-it
frontend
#41599
opened May 4, 2026 by
zhangj1an
Contributor
Loading…
3 of 4 tasks
[ROCm] Fix circular import in GCN arch detection
rocm
Related to AMD ROCm
#41596
opened May 4, 2026 by
naarob
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.