Skip to content

[ATOM-vLLM] Upgrade vLLM version to v0.22.0#1006

Open
whx-sjtu wants to merge 9 commits into
mainfrom
hexwang/upgrade_vllm_022
Open

[ATOM-vLLM] Upgrade vLLM version to v0.22.0#1006
whx-sjtu wants to merge 9 commits into
mainfrom
hexwang/upgrade_vllm_022

Conversation

@whx-sjtu
Copy link
Copy Markdown
Contributor

@whx-sjtu whx-sjtu commented Jun 1, 2026

Motivation

This PR is based on PR #750 and upgrades vLLM version to 0.22.0. This is necessary for further deepseek v4 enablement.

Technical Details

  1. Adapt new is_ssm interface.
  2. Adapt new MLA Prefill Backend.
  3. Fix mamba_type interface change.
  4. Fix topk_indices invalid value processing.
  5. Add new dependences to Dockerfile.
  6. Remove redundant self._use_cudnn_prefill

Test Plan

Full test on all ci cases.

Test Result

Case ID Variant Command Source Flexible EM Flexible Stderr Strict EM Strict Stderr
qwen3next_tp1 no-mtp recipe 0.8226 0.0105 0.7354 0.0122
qwen3next_mtp_tp1 mtp recipe 0.8173 0.0106 0.7316 0.0122
qwen35_397b_fp8_tp8 no-mtp recipe 0.8468 0.0099 0.8316 0.0103
llama31_8b_tp1 no-mtp recipe 0.7574 0.0118 0.6717 0.0129
glm5_tp8 no-mtp recipe 0.9500 0.0060 0.9500 0.0060
glm47_tp4 no-mtp recipe 0.9310 0.0070 0.9242 0.0073
glm47_mtp_tp4 mtp recipe 0.9462 0.0062 0.9454 0.0063
deepseek_v32_tp4 no-mtp ci-supplement 0.9522 0.0059 0.9515 0.0059
deepseek_v32_mtp_tp4 mtp ci-supplement 0.9507 0.0060 0.9500 0.0060
dsr1_fp8_tp8 no-mtp ci-supplement 0.9507 0.0060 0.9454 0.0063
dsr1_mxfp4_tp8 no-mtp recipe 0.9393 0.0066 0.9371 0.0067
kimi_k25_tp4 no-mtp recipe 0.9348 0.0068 0.9356 0.0068
kimi_k2_thinking_tp8 no-mtp recipe 0.9348 0.0068 0.9333 0.0069

@whx-sjtu whx-sjtu changed the title [vLLM Plugin] Upgrade vLLM version to v0.21.0 [vLLM Plugin] Upgrade vLLM version to v0.22.0 Jun 1, 2026
@zejunchen-zejun zejunchen-zejun changed the title [vLLM Plugin] Upgrade vLLM version to v0.22.0 [ATOM-vLLM] Upgrade vLLM version to v0.22.0 Jun 1, 2026
@zejunchen-zejun
Copy link
Copy Markdown
Collaborator

ATTN Refactor PR has been merged into main, so hexiang you can rebase the main

whx-sjtu added 5 commits June 2, 2026 02:46
Signed-off-by: whx-sjtu <xiaowang990929@gmail.com>
Signed-off-by: whx-sjtu <xiaowang990929@gmail.com>
Signed-off-by: whx-sjtu <xiaowang990929@gmail.com>
Signed-off-by: whx-sjtu <xiaowang990929@gmail.com>
Signed-off-by: whx-sjtu <xiaowang990929@gmail.com>
@whx-sjtu whx-sjtu force-pushed the hexwang/upgrade_vllm_022 branch from 5399a18 to ea0d20a Compare June 2, 2026 06:13
whx-sjtu added 2 commits June 2, 2026 08:09
Signed-off-by: whx-sjtu <xiaowang990929@gmail.com>
Signed-off-by: whx-sjtu <xiaowang990929@gmail.com>
@whx-sjtu whx-sjtu marked this pull request as ready for review June 2, 2026 09:08
Signed-off-by: whx-sjtu <xiaowang990929@gmail.com>
Comment thread docker/Dockerfile Outdated
Comment thread docker/Dockerfile
Signed-off-by: whx-sjtu <xiaowang990929@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants