Skip to content

[Optimization] Deduplicate shared image/video utilities across VL processors#6988

Open
luukunn wants to merge 13 commits intoPaddlePaddle:developfrom
luukunn:merge_processor
Open

[Optimization] Deduplicate shared image/video utilities across VL processors#6988
luukunn wants to merge 13 commits intoPaddlePaddle:developfrom
luukunn:merge_processor

Conversation

@luukunn
Copy link
Collaborator

@luukunn luukunn commented Mar 24, 2026

Motivation

  • Extract shared image utility functions (smart_resize, round/ceil/floor_by_factor, is_scaled_image) into fastdeploy/input/image_processors/common.py, covering both qwen and paddleocr variants
  • Extract shared video utilities (VideoReaderWrapper, read_video_decord, sample_frames) into fastdeploy/input/video_utils.py, with separate sample_frames_qwen and sample_frames_paddleocr variants
  • Update all 4 VL image_processor.py files to import from the new shared modules instead of maintaining local copies
  • Update qwen_vl, qwen3_vl, paddleocr_vl processor files to use sample_frames from video_utils.py; remove now-empty qwen_vl_processor/process_video.py and paddleocr_vl_processor/process_video.py

💡 If this PR is a Cherry Pick, the PR title needs to follow the format by adding the [Cherry-Pick] label at the very beginning and appending the original PR ID at the end. For example, [Cherry-Pick][CI] Add check trigger and logic(#5191)

💡 如若此PR是Cherry Pick,PR标题需遵循格式,在最开始加上[Cherry-Pick]标签,以及最后面加上原PR ID,例如[Cherry-Pick][CI] Add check trigger and logic(#5191)

Modifications

Usage or Command

Accuracy Tests

Checklist

  • Add at least a tag in the PR title.
    • Tag list: [[FDConfig],[APIServer],[Engine], [Scheduler], [PD Disaggregation], [Executor], [Graph Optimization], [Speculative Decoding], [RL], [Models], [Quantization], [Loader], [OP], [KVCache], [DataProcessor], [BugFix], [Docs], [CI], [Optimization], [Feature], [Benchmark], [Others], [XPU], [HPU], [GCU], [DCU], [Iluvatar], [Metax]]
    • You can add new tags based on the PR content, but the semantics must be clear.
  • Format your code, run pre-commit before commit.
  • Add unit tests. Please write the reason in this PR if no unit tests.
  • Provide accuracy results.
  • If the current PR is submitting to the release branch, make sure the PR has been submitted to the develop branch, then cherry-pick it to the release branch with the [Cherry-Pick] PR tag.

Copilot AI review requested due to automatic review settings March 24, 2026 07:39
@paddle-bot
Copy link

paddle-bot bot commented Mar 24, 2026

Thanks for your contribution!

This comment was marked as resolved.

@codecov-commenter
Copy link

codecov-commenter commented Mar 24, 2026

Codecov Report

❌ Patch coverage is 82.52427% with 36 lines in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (develop@522d12c). Learn more about missing BASE report.

Files with missing lines Patch % Lines
fastdeploy/input/video_utils.py 79.38% 20 Missing and 7 partials ⚠️
fastdeploy/input/image_processors/common.py 85.71% 5 Missing and 4 partials ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             develop    #6988   +/-   ##
==========================================
  Coverage           ?   73.87%           
==========================================
  Files              ?      399           
  Lines              ?    56041           
  Branches           ?     8842           
==========================================
  Hits               ?    41398           
  Misses             ?    11724           
  Partials           ?     2919           
Flag Coverage Δ
GPU 73.86% <82.52%> (?)

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copilot AI review requested due to automatic review settings March 24, 2026 12:12

This comment was marked as resolved.

This comment was marked as resolved.

Copilot AI review requested due to automatic review settings March 24, 2026 17:24
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 16 out of 16 changed files in this pull request and generated 3 comments.

Comment on lines +15 to +16
"""Shared video utilities: VideoReaderWrapper, read_video_decord, and sample_frames."""

Copy link

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PR 描述里虽然写了 Motivation,但 Modifications / Usage or Command / Accuracy Tests 等小节仍为空(或缺少具体可复现的验证方式)。建议补充:本次抽取公共模块后如何验证行为一致(例如运行哪些单测/示例命令)。

Copilot uses AI. Check for mistakes.
Comment on lines +76 to +79
video_path.seek(0)
tmp_bytes = video_path.read()
video_path.seek(0)
if _is_gif(tmp_bytes):
Copy link

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

VideoReaderWrapper.__init__ 里对 io.BytesIO 做 GIF 判断时使用了 video_path.read() 读取整个流到内存,仅为了检查 magic header 会在大视频输入时造成不必要的内存/耗时开销。建议只读取前 6 个字节进行判断(并恢复流位置),避免整段读取。

Suggested change
video_path.seek(0)
tmp_bytes = video_path.read()
video_path.seek(0)
if _is_gif(tmp_bytes):
# Only read the first 6 bytes to check GIF header and restore position
current_pos = video_path.tell()
header = video_path.read(6)
video_path.seek(current_pos)
if _is_gif(header):
video_path.seek(0)
tmp_bytes = video_path.read()

Copilot uses AI. Check for mistakes.
Comment on lines +128 to +129
video_meta contains keys: "fps", "duration", "num_of_frame".
"""
Copy link

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

read_video_decord 的参数 save_to_disk 在函数体内未被使用,会让调用方误以为该开关生效。若是为了兼容旧签名,建议在 docstring 里明确说明目前忽略该参数;否则建议移除该参数或用 **kwargs 吸收以减少误用。

Suggested change
video_meta contains keys: "fps", "duration", "num_of_frame".
"""
video_meta contains keys: "fps", "duration", "num_of_frame".
Note:
The `save_to_disk` argument is currently ignored and kept only for
backward compatibility with older function signatures. Passing
save_to_disk=True will not change the behavior.
"""
if save_to_disk:
data_processor_logger.warning(
"Argument `save_to_disk` in `read_video_decord` is deprecated and "
"currently ignored. The video is not saved to disk."
)

Copilot uses AI. Check for mistakes.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants