feat(tools): task tool set by VascoSch92 · Pull Request #2143 · OpenHands/software-agent-sdk

VascoSch92 · 2026-02-20T11:47:55Z

Summary

Adding the TaskToolSet tool, but without the run_in_background feature, as discussed in #2100.

The arguments currently missing from this API (compared to the Claude Code task tool) are:

run_in_background
model

A difference between this version of TaskToolSet and DelegateTool is the ability to resume a task. The task_tool_set example demonstrates this mechanism.

Note
The tool description has been updated and tailored to reflect the current functionality (since background tasks are not supported).

(ref issue #2057)

Docs
OpenHands/docs#352

Checklist

If the PR is changing/adding functionality, are there tests to reflect this?
If there is an example, have you run the example to make sure that it works?
If there are instructions on how to run the code, have you followed the instructions and made sure that it works?
If the feature is significant enough to require documentation, is there a PR open on the OpenHands/docs repository with the same branch name?
Is the github CI passing?

Agent Server images for this PR

• GHCR package: https://github.com/OpenHands/agent-sdk/pkgs/container/agent-server

Variants & Base Images

Variant	Architectures	Base Image	Docs / Tags
java	amd64, arm64	`eclipse-temurin:17-jdk`	Link
python	amd64, arm64	`nikolaik/python-nodejs:python3.12-nodejs22`	Link
golang	amd64, arm64	`golang:1.21-bookworm`	Link

Pull (multi-arch manifest)

# Each variant is a multi-arch manifest supporting both amd64 and arm64
docker pull ghcr.io/openhands/agent-server:a866cf8-python

Run

docker run -it --rm \
  -p 8000:8000 \
  --name agent-server-a866cf8-python \
  ghcr.io/openhands/agent-server:a866cf8-python

All tags pushed for this build

ghcr.io/openhands/agent-server:a866cf8-golang-amd64
ghcr.io/openhands/agent-server:a866cf8-golang_tag_1.21-bookworm-amd64
ghcr.io/openhands/agent-server:a866cf8-golang-arm64
ghcr.io/openhands/agent-server:a866cf8-golang_tag_1.21-bookworm-arm64
ghcr.io/openhands/agent-server:a866cf8-java-amd64
ghcr.io/openhands/agent-server:a866cf8-eclipse-temurin_tag_17-jdk-amd64
ghcr.io/openhands/agent-server:a866cf8-java-arm64
ghcr.io/openhands/agent-server:a866cf8-eclipse-temurin_tag_17-jdk-arm64
ghcr.io/openhands/agent-server:a866cf8-python-amd64
ghcr.io/openhands/agent-server:a866cf8-nikolaik_s_python-nodejs_tag_python3.12-nodejs22-amd64
ghcr.io/openhands/agent-server:a866cf8-python-arm64
ghcr.io/openhands/agent-server:a866cf8-nikolaik_s_python-nodejs_tag_python3.12-nodejs22-arm64
ghcr.io/openhands/agent-server:a866cf8-golang
ghcr.io/openhands/agent-server:a866cf8-java
ghcr.io/openhands/agent-server:a866cf8-python

About Multi-Architecture Support

Each variant tag (e.g., a866cf8-python) is a multi-arch manifest supporting both amd64 and arm64
Docker automatically pulls the correct architecture for your platform
Individual architecture tags (e.g., a866cf8-python-amd64) are also available if needed

github-actions · 2026-02-20T11:54:54Z

Coverage Report •

File	Stmts	Miss	Cover	Missing
openhands-tools/openhands/tools/task
definition.py	56	26	53%	66, 72–74, 76–78, 80–81, 89, 94–95, 98, 100, 145, 193–194, 196–198, 200, 204–205, 207–208, 214
manager.py	116	73	37%	64–66, 70–72, 79, 81, 84, 87–88, 92–93, 97, 101–104, 107–109, 111, 135–136, 138–139, 144, 150, 157–158, 163–165, 173, 180, 189–191, 199, 205, 215–216, 218–221, 223, 239–240, 242, 244–245, 247, 251–252, 254–257, 259–267, 269, 271, 275–276, 278
TOTAL	18933	9623	49%

all-hands-bot

🟡 Acceptable - Feature works and solves a real problem (task resumption). Some data structure complexity and naming could be cleaner, but nothing blocking.

Key Insight: The state management spreads task tracking across multiple data structures when TaskState already has status - consolidating this would eliminate special case tracking.

openhands-tools/openhands/tools/task/manager.py

openhands-tools/openhands/tools/task/impl.py

enyst · 2026-02-21T01:24:17Z

@OpenHands Do a /codereview-roasted on this PR. Take a look at some other tool too to see if we follow the patterns in the codebase.

Post your review and your answer as a comment to this PR. Note that it will be rendered as markdown.

Once you are done, take a step back and look at your review and diff with fresh eyes: section by section, was it more minor or major than it seemed to be? Reanalyze. When you're done, post your reanalysis as a follow-up comment.

openhands-ai · 2026-02-21T01:24:38Z

I'm on it! enyst can track my progress at all-hands.dev

enyst · 2026-02-21T01:28:34Z

Reanalysis (fresh eyes)

Stepping back: I probably came in hot on a couple of things that are more “taste” than “stop-ship”. Here’s how I’d re-rank it after a second pass.

Still major / correctness

Registering TaskTool in the global registry is a real runtime bug. This isn’t style — it’s an API contract violation with register_tool(...). If someone tries Tool(name="task"), it will fail tool resolution immediately. That’s genuinely merge-blocking.
TaskObservation.visualize error branch returning super().visualize also looks like plain wrong behavior (dead code / unexpected output). Not as catastrophic as the registry issue, but it’s still a correctness fix, not bikeshedding.

Probably not worth blocking (but worth considering)

The big inline TASK_TOOL_DESCRIPTION + # noqa: E501 is ugly, but it’s not inherently wrong. Switching to a template is a quality improvement, not a functional requirement.
Using tempfile.mkdtemp(...) for the resumable persistence dir is potentially inconsistent with other persistence paths, but in practice it might be totally acceptable for the initial iteration (especially if this is currently positioned as a standalone/experimental tool).
The _visualizer / _name access is a layering smell, but it’s also optional functionality (sender label) and could be tightened later if needed.

Net assessment

The core idea (ToolSet factory that creates a task tool backed by a shared manager, with resume support) is sound and matches existing patterns (BrowserToolSet). The PR is close — it just needs that registry fix (and ideally the visualize bug fix) to avoid shipping a broken API surface.

github-actions · 2026-02-23T07:35:44Z

API breakage checks (Griffe)

Result: Passed

Action log

all-hands-bot

Taste Rating: 🟡 Acceptable - Feature works and solves a real problem (task resumption). Some data structure complexity and naming could be cleaner, but nothing blocking.

[IMPROVEMENT OPPORTUNITIES]

The previous review flagged an unresolved issue that's still present in manager.py (see inline comment).

VERDICT:
✅ Worth merging - Adds useful task resumption capability with comprehensive tests. The defensive programming pattern is minor and doesn't affect correctness.

KEY INSIGHT:
Task resumption via conversation persistence is a solid feature. The state management is straightforward - the only complexity comes from defensive checks against known types.

openhands-tools/openhands/tools/task/manager.py

enyst · 2026-02-23T14:47:50Z

Thank you! LGTM

@simonrosenberg I'd love to know your thoughts on this PR

examples/01_standalone_sdk/41_task_tool_set.py

openhands-tools/openhands/tools/task/__init__.py

openhands-tools/openhands/tools/task/definition.py

openhands-tools/openhands/tools/task/manager.py

openhands-tools/openhands/tools/task/definition.py

openhands-tools/openhands/tools/task/manager.py

Co-authored-by: simonrosenberg <157206163+simonrosenberg@users.noreply.github.com>

VascoSch92 · 2026-02-25T09:02:21Z

Merging also if [Optional] Docs example /check-examples are not passing.

This is becausse of another example in the code which has not a documentation now.

The example added in the PR has docs.

VascoSch92 requested a review from all-hands-bot February 20, 2026 11:48

VascoSch92 requested a review from simonrosenberg February 20, 2026 14:03

VascoSch92 marked this pull request as ready for review February 20, 2026 14:04

enyst added the review-this This label triggers a PR review by OpenHands label Feb 20, 2026

VascoSch92 requested a review from enyst February 20, 2026 23:35

enyst added review-this This label triggers a PR review by OpenHands and removed review-this This label triggers a PR review by OpenHands labels Feb 21, 2026

all-hands-bot approved these changes Feb 21, 2026

View reviewed changes

This comment was marked as outdated.

Sign in to view

This comment was marked as duplicate.

Sign in to view

VascoSch92 added 3 commits February 23, 2026 08:34

task tool set

a9f215e

update

d81b76c

completed instead of successfull

10054bc

VascoSch92 force-pushed the vasco/task-tool-set branch from 8041249 to 10054bc Compare February 23, 2026 07:35

add tests

16c26d3

VascoSch92 requested a review from all-hands-bot February 23, 2026 08:29

re-number example

806e704

all-hands-bot approved these changes Feb 23, 2026

View reviewed changes

openhands-tools/openhands/tools/task/manager.py Show resolved Hide resolved

VascoSch92 mentioned this pull request Feb 23, 2026

feat(tools): Claude Delegation Tools #2100

Closed

7 tasks