ggml-webgpu: Support GPU profiling beyond the maximum query count by yomaytk · Pull Request #22995 · ggml-org/llama.cpp

yomaytk · 2026-05-13T00:42:36Z

Overview

This PR fixes the bug described in the Additional Information section.

Flush timestamp slots and reset the timestamp state when the number of used timestamp slots is nearly full.

I confirmed that GPU profiles can now be collected for Qwen3.5-35B-A3B-GGUF and several other models (Qwen3.5, Qwen3.6, Gemma 4, and Llama 3).

Additional Information

I noticed that unsloth/Qwen3.5-35B-A3B-GGUF overflowed the timestamp QuerySet when I tried to collect a GPU profile:

llama.cpp/ggml/src/ggml-webgpu/ggml-webgpu.cpp:571: GGML_ASSERT(ctx->profile_timestamp_query_count + 2 <= WEBGPU_MAX_PROFILE_QUERY_COUNT) failed

This suggests that we need logic to allow profile collection even when a model requires more than 4096 timestamp queries.

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: YES - I used AI to investigate WebGPU specification

reeselevine · 2026-05-13T16:30:06Z

thanks, this is a nice clean addition!

flush the gpu profile timestamp before the queryset is overflowed

5576c7d

yomaytk requested a review from a team as a code owner May 13, 2026 00:42

github-actions Bot added ggml changes relating to the ggml tensor library for machine learning WebGPU labels May 13, 2026

reeselevine approved these changes May 13, 2026

View reviewed changes

reeselevine requested a review from CISC May 13, 2026 16:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml-webgpu: Support GPU profiling beyond the maximum query count#22995

ggml-webgpu: Support GPU profiling beyond the maximum query count#22995
yomaytk wants to merge 1 commit into
ggml-org:masterfrom
yomaytk:new-flush-gpu-profile

yomaytk commented May 13, 2026 •

edited

Loading

Uh oh!

reeselevine commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yomaytk commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Additional Information

Requirements

Uh oh!

reeselevine commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yomaytk commented May 13, 2026 •

edited

Loading