Evals TUI tree traversal by miguelg719 · Pull Request #2100 · browserbase/stagehand

miguelg719 · 2026-05-08T23:55:00Z

why

what changed

test plan

Summary by cubic

Introduces a unified command tree for the Evals TUI/CLI with context-aware traversal, consistent parsing, and clearer help routing. Supports Linear STG-1880 by improving onboarding and command discovery with > navigation, an evals sigil for root jumps, and contextual help.

New Features
- Single buildCommandTree powers both REPL and argv dispatch.
- > works as a path separator in REPL and argv; quoted > is preserved in REPL, while argv re-splits any arg containing >.
- Context navigation: .. goes up one level; bare evals jumps to root or strips a leading sigil; invoking a node with children auto-enters that context in REPL; prompt shows the current path; banner updated.
- Help routing: --help after a leaf calls that node’s help; help shows help for the current context.
- Root fallback: unknown tokens at root call run (e.g., evals act); unknown at depth errors for clarity.
- REPL quality of life: Esc pops context when idle; Esc still aborts in-flight runs (cooperative/double-press aggressive).
Refactors
- cli.ts and repl.ts now resolve and execute via dispatch from the shared command tree; added tokenizeArgv for argv > splitting and renderPrompt for context-aware prompts.
- Moved line tokenization to tui/tokenize.ts; added tui/commandTree.ts with resolution, prompt rendering, and argv tokenization.
- Kept command handlers lazy-loaded to avoid heavy imports on quiet commands; added focused tests for the tree, tokenization (incl. > chaining), argv sigil handling, and CLI edge cases.

^{Written for commit fef0768. Summary will update on new commits.}

changeset-bot · 2026-05-08T23:55:05Z

⚠️ No Changeset found

Latest commit: fef0768

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

cubic-dev-ai

2 issues found across 7 files

Confidence score: 3/5

There is a concrete regression risk in packages/evals/tui/commandTree.ts: for inputs like evals act, the fallback forwards tokens including the "evals" sigil to the run handler, which can misparse arguments and break expected command handling.
A second medium-severity concern is in packages/evals/tui/commandTree.ts, where user-facing failures are thrown as generic Error values and echoed with raw token interpolation, increasing the chance of confusing or unsanitized REPL error output.
Given one high-confidence parsing bug (7/10) plus one additional medium issue (6/10), this is not merge-blocking by itself but carries meaningful user-impact risk if shipped without a fix.
Pay close attention to packages/evals/tui/commandTree.ts - command fallback token forwarding and REPL error sanitization need verification.

Prompt for AI agents (unresolved issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="packages/evals/tui/commandTree.ts">

<violation number="1" location="packages/evals/tui/commandTree.ts:242">
P2: Custom agent: **Exception and error message sanitization**

User-facing command errors are thrown as generic `Error` and echoed to the REPL with raw token interpolation, rather than using sanitized typed errors.</violation>

<violation number="2" location="packages/evals/tui/commandTree.ts:275">
P1: When the `evals` sigil is present and the token after it is unknown (e.g., `evals act`), the root fallback passes the full `tokens` array — including the leading `"evals"` — to the run handler. This causes `parseRunArgs` to misinterpret `"evals"` as a target/flag. Strip the sigil before delegating.</violation>
</file>

Architecture diagram

sequenceDiagram
    participant Shell as Shell/argv
    participant CLI as cli.ts
    participant REPL as repl.ts
    participant CmdTree as commandTree.ts
    participant Tok as tokenize.ts
    participant Handler as Command Handlers
    participant Disc as discovery.ts
    participant Run as run command

    Note over Shell,Run: NEW: Unified command tree traversal

    alt argv mode (shell)
        Shell->>CLI: node evals [args...]
        CLI->>CmdTree: buildCommandTree()
        CmdTree-->>CLI: root CommandNode
        CLI->>CLI: tokenizeArgv(args) — split on >
        CLI->>CmdTree: dispatch(root, tokens, ctx)
    else REPL mode (interactive)
        Shell->>REPL: node evals (no args)
        REPL->>REPL: printBanner()
        REPL->>CmdTree: buildCommandTree()
        CmdTree-->>REPL: root CommandNode
        loop Each input line
            REPL->>Tok: tokenize(line) — split on whitespace or >
            Tok-->>REPL: tokens[]
            REPL->>CmdTree: dispatch(root, tokens, ctx)
        end
    end

    Note over CmdTree: Resolution logic (resolveCommand)

    CmdTree->>CmdTree: Check first token for meta commands (..,help,exit,clear)
    alt Meta command
        CmdTree->>REPL: Pop context / print help / exit
    else Leading "evals" sigil
        CmdTree->>CmdTree: Strip sigil, resolve from root
    else Relative path (REPL context)
        CmdTree->>CmdTree: walkPath(contextPath) → current node
        CmdTree->>CmdTree: matchPath(current, tokens) → deepest match
        alt Full match
            CmdTree->>CmdTree: Check for --help/help/-h on first arg
            opt Help requested
                CmdTree->>Handler: node.printHelp(absolutePath)
            end
            CmdTree->>Handler: node.handler(args, ctx)
            alt REPL mode + node has children + no args
                CmdTree->>REPL: setContextPath(absolutePath)
                REPL->>REPL: renderPrompt(contextPath)
            end
        else Partial match at root (unknown token)
            CmdTree->>Run: fallback: run.handler(tokens, ctx)
            Run->>Disc: discoverTasks(getRuntimeTasksRoot())
            Disc-->>Run: registry
            Run->>Run: execute run with registry
        else Partial match at depth (unknown)
            CmdTree-->>REPL: throw Error (unknown subcommand)
        end
    end

    Note over REPL: Context-aware prompt
    REPL->>REPL: renderPrompt(contextPath) → "evals > config > core > "

    Note over REPL: Esc key handling (CHANGED)
    REPL->>REPL: On Esc press
    alt abortRef.current is null (idle)
        alt contextPath.length > 0
            REPL->>REPL: pop context level
            REPL->>REPL: setPrompt(renderPrompt(contextPath))
        end
    else abortRef.current exists (run in flight)
        REPL->>REPL: Cooperative/aggressive abort logic
        REPL->>REPL: abortRef.current.abort(reason)
    end

    Note over CLI: Root fallback for unknown tokens
    CmdTree->>Ctx: unknown at root → call run handler with all tokens
    Ctx->>CmdTree: runNode.handler(tokens, ctx)

    Note over CmdTree: Registry lazy loading
    CmdTree->>REPL: getRegistry()
    REPL->>Disc: discoverTasks(resolvedTasksRoot)
    Disc-->>REPL: TaskRegistry
    REPL->>REPL: cache in ctx.setRegistry()

    Note over Shell,Run: Banner changed to mention .. context navigation

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review, or fix all with cubic.}

cubic-dev-ai · 2026-05-11T05:23:46Z

+      if (result.context.length === 0) {
+        const runNode = findChild(root, "run");
+        if (runNode?.handler) {
+          await runNode.handler(tokens, ctx);


P1: When the evals sigil is present and the token after it is unknown (e.g., evals act), the root fallback passes the full tokens array — including the leading "evals" — to the run handler. This causes parseRunArgs to misinterpret "evals" as a target/flag. Strip the sigil before delegating.

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At packages/evals/tui/commandTree.ts, line 275: <comment>When the `evals` sigil is present and the token after it is unknown (e.g., `evals act`), the root fallback passes the full `tokens` array — including the leading `"evals"` — to the run handler. This causes `parseRunArgs` to misinterpret `"evals"` as a target/flag. Strip the sigil before delegating.</comment> <file context> @@ -0,0 +1,660 @@ + if (result.context.length === 0) { + const runNode = findChild(root, "run"); + if (runNode?.handler) { + await runNode.handler(tokens, ctx); + return; + } </file context>

cubic-dev-ai · 2026-05-11T05:23:46Z

@@ -0,0 +1,660 @@
+/**


P2: Custom agent: Exception and error message sanitization

User-facing command errors are thrown as generic Error and echoed to the REPL with raw token interpolation, rather than using sanitized typed errors.

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At packages/evals/tui/commandTree.ts, line 242: <comment>User-facing command errors are thrown as generic `Error` and echoed to the REPL with raw token interpolation, rather than using sanitized typed errors.</comment> <file context> @@ -0,0 +1,660 @@ + if (result.node.handler) { + await result.node.handler(result.args, ctx); + } else if (result.args.length > 0) { + throw new Error( + `Unknown subcommand "${result.args[0]}" in ${pretty( + result.absolutePath, </file context>

Evals TUI tree traversal

fef0768

miguelg719 force-pushed the miguelgonzalez/stg-1880-evals-nail-tui-onboarding-experience-and-command-discovery branch from bacbe10 to fef0768 Compare May 9, 2026 00:09

miguelg719 marked this pull request as ready for review May 11, 2026 05:18

cubic-dev-ai Bot reviewed May 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evals TUI tree traversal#2100

Evals TUI tree traversal#2100
miguelg719 wants to merge 1 commit intomainfrom
miguelgonzalez/stg-1880-evals-nail-tui-onboarding-experience-and-command-discovery

miguelg719 commented May 8, 2026 •

edited by cubic-dev-ai Bot

Loading

Uh oh!

changeset-bot Bot commented May 8, 2026 •

edited

Loading

Uh oh!

cubic-dev-ai Bot left a comment

Uh oh!

cubic-dev-ai Bot May 11, 2026 •

edited

Loading

Uh oh!

cubic-dev-ai Bot May 11, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

miguelg719 commented May 8, 2026 • edited by cubic-dev-ai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

why

what changed

test plan

Summary by cubic

Uh oh!

changeset-bot Bot commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ No Changeset found

Uh oh!

cubic-dev-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai Bot May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai Bot May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

miguelg719 commented May 8, 2026 •

edited by cubic-dev-ai Bot

Loading

changeset-bot Bot commented May 8, 2026 •

edited

Loading

cubic-dev-ai Bot May 11, 2026 •

edited

Loading

cubic-dev-ai Bot May 11, 2026 •

edited

Loading