ui: fix stop/continue during an agentic loop by ServeurpersoCom · Pull Request #23356 · ggml-org/llama.cpp

ServeurpersoCom · 2026-05-19T18:01:25Z

Overview

fix: continue button preserves tool_calls and routes through the agentic loop

The Continue button was rebuilding the final assistant message by hand as role plus content plus reasoning_content, dropping tool_calls and attachments from the persisted DatabaseMessage, which broke continue_final_message for any turn carrying tool_calls.

Additional information

A pure classifier in lib/utils/agentic.ts now walks the history around the target and returns append_text, rerun_turn, or next_turn, and continueAssistantMessage dispatches accordingly: classical resume for plain text, branch via regenerateMessageWithBranching when tool_calls have no results yet, or anchor a fresh agentic turn at the last tool result.

10 unit tests cover the classifier, the full vitest suite stays at 224/224, type check and prod build are clean.

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: YES Opus 4.7 + containers with GPU

aldehir · 2026-05-19T18:49:44Z

Just want to mention that it is not feasible to resume a partial tool call with the chat completions API. There are too many models/templates to transform partial JSON to their respective tool calling notation. The maintenance cost is not worth it IMO.

If this is desired, then the continue logic needs to be re-imagined to one of the following:

A stateful backend that preserves the raw generation. This could be something built within your background task implementation.
Emit raw generation deltas along side the normal deltas in the SSE stream. More traffic and worse for unstable connections.

The logic here looks fine from a prompt construction view. Worse case, the tool call would need to be regenerated from scratch but could still be seeded with the reasoning/content.

ServeurpersoCom · 2026-05-20T04:49:49Z

We can improve it a little by seeding the regenerated turn with the reasoning/content already produced before the cut, so a continue lands in the CoT of the last tool call instead of regenerating it from scratch, but we can't do any better than that.

ServeurpersoCom · 2026-05-20T05:02:28Z

It adds quite a bit of complexity, because seeding the CoT is only useful if the regenerated turn re enters the agentic loop so the tool call actually fires, otherwise the user just sees the tool call reappear without running, so regenerating from scratch stays way simpler.

Address allozaur review: replace the kind string literals in the ContinueIntent union with a ContinueIntentKind enum (append_text, rerun_turn, next_turn), next to the other agentic enums. Producers, the chat store consumers and the unit test all reference the enum, no magic strings left.

ui: fix stop/continue during an agentic loop

bf34f6f

ServeurpersoCom requested a review from a team as a code owner May 19, 2026 18:01

This was referenced May 19, 2026

Feature Request: WebUI response streaming is fragile #21754

Closed

Feature Request: Background streaming for llama-ui #23136

Open

github-actions Bot added examples server/ui labels May 20, 2026

allozaur self-assigned this May 20, 2026

allozaur requested changes May 21, 2026

View reviewed changes

Comment thread tools/ui/src/lib/stores/chat.svelte.ts Outdated

Comment thread tools/ui/src/lib/utils/agentic.ts Outdated

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ui: fix stop/continue during an agentic loop#23356

ui: fix stop/continue during an agentic loop#23356
ServeurpersoCom wants to merge 2 commits into
ggml-org:masterfrom
ServeurpersoCom:ui/fix-continue-mcp

ServeurpersoCom commented May 19, 2026

Uh oh!

aldehir commented May 19, 2026 •

edited

Loading

Uh oh!

ServeurpersoCom commented May 20, 2026

Uh oh!

ServeurpersoCom commented May 20, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ServeurpersoCom commented May 19, 2026

Overview

fix: continue button preserves tool_calls and routes through the agentic loop

Additional information

Requirements

Uh oh!

aldehir commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ServeurpersoCom commented May 20, 2026

Uh oh!

ServeurpersoCom commented May 20, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

aldehir commented May 19, 2026 •

edited

Loading