Agentbox

A manifest-driven, reproducible runtime for sovereign software agents.

One TOML manifest. One Nix flake. One runtime contract.

Quickstart · Why Agentbox · Features · Architecture · Docs · Contributing

Why Agentbox

Most agent containers fail in one of four ways:

they are mutable at boot, with npm install and curl | bash in the startup path
they hardcode one backend mesh and cannot stand alone
they expose a pile of tools but no durable runtime contract
they emit interesting data but not in a form agents or humans can actually monitor and resolve

Agentbox takes the opposite shape. A single agentbox.toml manifest drives:

the Nix package graph
the generated runtime image
the generated compose file
the generated supervisor config
the health, readiness, and observability contract

That gives you a container that can run standalone or federate into a larger system without changing codepaths. It can expose local storage, local relay, local orchestration, privacy-filtered linked-data surfaces, canonical URIs, and a browser for navigating emitted resources, while still being built as a pinned image rather than assembled live at boot.

Quickstart

Interactive onboarding

./scripts/start-agentbox.sh
./agentbox.sh up --build
./agentbox.sh health

Non-interactive source build

git clone https://github.com/DreamLab-AI/agentbox.git
cd agentbox

./scripts/agentbox-config-validate.sh
./agentbox.sh up --build
./agentbox.sh health
./agentbox.sh shell

Use the published image

export AGENTBOX_IMAGE_REF=ghcr.io/dreamlab-ai/agentbox:latest
docker pull "$AGENTBOX_IMAGE_REF"
./agentbox.sh up --registry
./agentbox.sh health

Main operator docs:

What’s In The Current Platform

Core runtime

Manifest-driven Nix build with one generated runtime path
Pluggable five-slot adapter architecture: beads, pods, memory, events, orchestrator
Local, external, or off implementations per slot
Immutable bootstrap: runtime dependencies are baked into the image, not installed on startup
Multi-arch OCI images for amd64 and arm64
Generated compose + generated supervisord + generated runtime config from the same manifest

Sovereign data stack

First-party solid-pod-rs as the primary local pod server
did:nostr:<pubkey> identity loop across relay, pod, credentials, and receipts
Embedded Nostr relay and pod mailbox bridge
Local privacy-filter sidecar with per-slot strict/soft/off policies
Linked-data surfaces for pods, events, credentials, DID docs, provenance, capability descriptors, payments, memory catalogues, architecture docs, and HTTP meta
Canonical URI grammar plus /v1/uri/<urn> resolver
JSON-LD browser slot at /lo/* for navigating emitted resources

Agent tooling

Claude, Codex, Gemini, ruflo, claude-flow, agentic-qe, nagual-qe, codebase-memory
Built-in and external MCP service support
Playwright, ComfyUI, QGIS, Blender, LaTeX, report-builder, and browser automation paths
Consultant tier for named external-model consultation workflows
Desktop mode with tiled terminal workflows when enabled

Operations and hardening

/livez, /ready, /health, Prometheus metrics, OTLP support
Hardened baseline: non-root, read_only, cap_drop: [ALL], no-new-privileges
Explicit feature exceptions instead of ambient privilege creep
Backup/restore flow for runtime state
Registry-image or local-build workflows using the same runtime contract

Features

Build and composition

Capability	Summary
Reproducible builds	Nix flake + pinned hashes + content-addressed image generation.
Manifest-gated composition	`agentbox.toml` controls what is built and what is run.
Runtime/image parity	Compose, supervisor, image contents, and probes all come from the same manifest.
Immutable bootstrap	No package-manager bootstrapping in the normal startup path.
Multi-arch publishing	Local and registry workflows support `amd64` and `arm64`.

Runtime architecture

Capability	Summary
Five-slot adapters	Durable integration seams for beads, pods, memory, events, and orchestration.
Standalone or federated	Same repo and runtime can self-host or plug into a host mesh.
Probe contract	`/livez`, `/ready`, and `/health` are first-class runtime signals.
Observability	Structured logs, Prometheus metrics, OTLP export, and runtime metadata.
Generated runtime contract	Image selection, ports, hardening, and sidecars are all derived, not hand-maintained.

Sovereign stack

Layer	Summary
Identity	`did:nostr:<pubkey>` as the primary externally visible agent identifier.
Pods	`solid-pod-rs` with Solid Protocol 0.11, WAC 2.0, rate limiting, quota, and webhook signing.
Relay	Embedded Nostr relay plus inbox/outbox bridge.
Privacy	Local `openai/privacy-filter` sidecar on adapter dispatch boundaries.
Linked data	JSON-LD 1.1 surfaces across operational and domain resources.
Canonical URIs	Stable names for emitted entities with a resolver endpoint.
Browser	Linked-data viewer that follows `@id` and renders resources by pane.

Newer linked-data and URI work

The platform now includes a real naming and browsing layer, not just emitters:

Canonical URIs: two shapes, did:nostr:<pubkey> for identity and urn:agentbox:<kind>:[<scope>:]<local> for everything else
URI resolver: /v1/uri/<urn> maps resolvable names to current representations
Linked-data browser: /lo/* serves a JSON-LD-aware browser and agentbox-specific panes
Linked-data surfaces: the external grammar spans pods, Nostr envelopes, VCs, DID docs, provenance, WoT, payments, DCAT, and docs metadata

This matters because the runtime now does more than expose APIs. It exposes a coherent namespace that agents and humans can inspect, dereference, and reason over.

Agent and MCP layer

Capability	Summary
Consultant tier	Named consultant MCPs for Codex, Gemini, Z.AI, Perplexity, and DeepSeek.
MCP support	Local MCP servers and external MCP-facing capabilities.
Browser automation	Playwright and agent-browser paths.
Media and spatial tooling	ComfyUI, ImageMagick, FFmpeg, Blender, QGIS, and 3DGS support paths.
QE/orchestration tooling	ruflo, claude-flow, agentic-qe, nagual-qe, and codebase-memory integrations.

Security and operations

Capability	Summary
Hardened baseline	Default container privileges are minimal and explicit.
Feature exception model	Security deltas are declared in the manifest rather than silently accumulated.
Secret hygiene	Management keys and sovereign identity material are not shipped as default literals.
Backup and restore	Runtime state can be exported and restored using the project tooling.
Remote operations	Provisioning and remote operation helpers exist for OCI, Fly, Hetzner, and bare targets.

Architecture

flowchart TB
    subgraph manifest["Manifest Contract"]
        M[agentbox.toml]
        V[validator]
    end

    subgraph build["Build"]
        F[flake.nix]
        I[OCI image]
        C[generated compose]
        S[generated supervisord]
    end

    subgraph runtime["Runtime"]
        API[management-api]
        AD[adapter resolver]
        POD[solid-pod-rs]
        RELAY[nostr relay]
        PF[privacy filter]
        LD[linked-data encoder]
        URI[/v1/uri resolver]
        VIEW[/lo browser]
    end

    M --> V
    M --> F
    F --> I
    F --> C
    F --> S
    I --> API
    S --> API
    API --> AD
    AD --> POD
    AD --> RELAY
    API --> PF
    API --> LD
    LD --> URI
    URI --> VIEW

Three rules matter more than anything else:

The manifest is the contract.
Adapters are the integration boundary.
Startup should realize the manifest, not invent new state.

Deeper reading:

Example flows

Sovereign local agent box

Generate a profile and build the image
Start the local pod server, relay, and management API
Emit JSON-LD surfaces for the resources you care about
Resolve did:nostr:<pubkey> and urn:agentbox:* identifiers through the management API
Browse those resources in /lo/*

Runtime as a host-mesh client

Set adapters to external implementations
Keep the same operator and probe surface locally
Route durable-state operations to the host environment
Still use the same URI and linked-data model for visibility

Build as a capability image

Enable the toolchains and skills you want in agentbox.toml
Validate
Build once with Nix
Run the same image locally, in CI, or via a registry ref

Platforms

Target	Build	Run	Notes
Linux x86_64	Native	Native	Full support, including the richest local feature set
Linux aarch64	Native	Native	Supported, subject to feature-specific gates
macOS	Compose/dev tooling	Docker Desktop/OrbStack/Colima	Usually CPU or remote-GPU paths
Windows	Compose/dev tooling	Docker Desktop + WSL2	WSL2 is the practical path
Remote Linux	Native or registry	Native	OCI/Fly/Hetzner/bare workflows supported

See:

Documentation

Operators

Sovereign stack and linked data

Developers

Canonical specs

Contributing

Start here:

Read docs/developer/architecture.md.
Validate the manifest before changing build/runtime behavior.
Prefer manifest-gated additions over ad hoc runtime mutation.
Treat hardening, probe semantics, URI grammar, and linked-data surfaces as architectural changes, not incidental code tweaks.

For substantial behavior changes, the repo already uses ADR/PRD/DDD documents as the source of truth. Follow that pattern.

License

Core project: MPL-2.0.

Some optional integrated components carry their own licenses. The linked-data browser slot, for example, uses linkedobjects/browser under AGPL-3.0 when enabled. See the relevant docs and component files for details.

Documentation · Issues · Releases · Container Registry

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agentbox

A manifest-driven, reproducible runtime for sovereign software agents.

Why Agentbox

Quickstart

Interactive onboarding

Non-interactive source build

Use the published image

What’s In The Current Platform

Core runtime

Sovereign data stack

Agent tooling

Operations and hardening

Features

Build and composition

Runtime architecture

Sovereign stack

Newer linked-data and URI work

Agent and MCP layer

Security and operations

Architecture

Example flows

Sovereign local agent box

Runtime as a host-mesh client

Build as a capability image

Platforms

Documentation

Operators

Sovereign stack and linked data

Developers

Canonical specs

Contributing

License

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Agentbox

A manifest-driven, reproducible runtime for sovereign software agents.

Why Agentbox

Quickstart

Interactive onboarding

Non-interactive source build

Use the published image

What’s In The Current Platform

Core runtime

Sovereign data stack

Agent tooling

Operations and hardening

Features

Build and composition

Runtime architecture

Sovereign stack

Newer linked-data and URI work

Agent and MCP layer

Security and operations

Architecture

Example flows

Sovereign local agent box

Runtime as a host-mesh client

Build as a capability image

Platforms

Documentation

Operators

Sovereign stack and linked data

Developers

Canonical specs

Contributing

License