fix: chunk prefill by jiayyu · Pull Request #1032 · ROCm/ATOM

jiayyu · 2026-06-02T07:14:27Z

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

Preempted seqs keep their decoded token_ids (preempt() only deallocates KV blocks) so seq.num_tokens > seq.num_prompt_tokens on re-admit. Computing num_new_tokens from num_prompt_tokens caused chunk=0 when a fully-cached prefix exhausted num_prompt_tokens, triggering the "chunk must be positive" assert under high concurrency benchmarks.

Copilot

Pull request overview

Fixes chunked-prefill scheduling so preempted sequences (whose KV blocks have been freed but whose decoded token_ids are retained) re-forward all of their tokens — not just the original prompt tokens — when they are re-admitted from the waiting queue. Also removes the DeepSeek-V4 carve-out that auto-disabled chunked prefill, indicating chunked prefill is now considered safe for V4.

Changes:

In scheduler.py, base num_new_tokens on seq.num_tokens instead of seq.num_prompt_tokens so previously-decoded tokens of a preempted seq are recomputed.
Remove the enable_chunked_prefill = False auto-override and accompanying warning for DeepSeek-V4 in config.py.
Drop the corresponding "DeepSeek-V4 auto-disables this" note from the --enable-chunked-prefill CLI help in arg_utils.py.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.

File	Description
atom/model_engine/scheduler.py	Use `num_tokens` (prompt + decoded) when sizing the new prefill chunk for waiting seqs, so preempted/requeued seqs recompute KV for all retained tokens.
atom/config.py	Remove DeepSeek-V4 auto-disable of chunked prefill and its warning.
atom/model_engine/arg_utils.py	Remove now-stale DeepSeek-V4 note from the chunked-prefill CLI help.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

jiayyu added 2 commits June 2, 2026 05:40

remove disable deepseek v4 chunk prefill flag

8be2f18

Copilot AI review requested due to automatic review settings June 2, 2026 07:14

Copilot started reviewing on behalf of jiayyu June 2, 2026 07:14 View session

Copilot AI reviewed Jun 2, 2026

View reviewed changes

fix format

80845d7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: chunk prefill#1032

fix: chunk prefill#1032
jiayyu wants to merge 3 commits into
mainfrom
fpz/dsv4

jiayyu commented Jun 2, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jiayyu commented Jun 2, 2026

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants