OpenCAS Usage Guide

Recommended Operator Path

The current repo is best operated through the dashboard and API server:

source .venv/bin/activate
python -m opencas --with-server --accept-bootstrap-responsibility

The acknowledgement flag is required only for non-TUI fresh bootstraps. It makes the first-boot boundary explicit: OpenCAS creates persistent continuity, not a disposable chat session, and deleting the state directory deletes that agent's continuity. Later launches against an existing state directory do not require the flag.

Default address:

http://127.0.0.1:8080/dashboard

Important Accuracy Note

The current CLI does not expose a separate interactive python -m opencas chat terminal mode. Conversation happens through:

the dashboard Chat tab
POST /api/chat/send
the WebSocket bridge at /ws
Telegram, when enabled and configured

Useful CLI Flags

python -m opencas --help

High-value current flags:

--with-server
--host
--port
--state-dir
--default-llm-model
--embedding-model-id
--provider-config-path
--provider-env-path
--credential-source-config-path
--credential-source-env-path
--credential-profile-id
--credential-env-key
--telegram-enabled
--telegram-disabled
--tui
--accept-bootstrap-responsibility

Dashboard Tabs

Tab	Purpose
Overview	High-level runtime, health, token, and memory summaries
Health	Doctor status, recent health history, runtime and event visibility
Chat	Session history, current provider/model lane, somatic context, current work, voice controls, and message send
Operations	Receipts, tasks, work objects, qualification runs, approval audit, and live process control
Schedule	Durable schedule items, calendar view, and run history
Usage	Token telemetry, dominant models and sources, provider telemetry notes, and process hygiene clues
Daydream	Reflections, conflicts, keeper promotion, and daydream-origin work
Memory	Atlas, search, embedding projection, node detail, and retrieval inspection
Identity	Self-model, user-model, continuity, musubi, and somatic state
Executive	Goals, plans, commitments, and executive snapshot
Platform	Capability inventory, extension lifecycle, and plugin trust control
System	Effective config, configured model options, Telegram, and phone setup/status
Logs	Runtime telemetry event stream and filtering controls

Chat Behavior

The Chat sidebar shows the active lane: provider, resolved model, auth profile, and auth source.
New assistant messages carry their own lane metadata in history.
Older assistant messages that predate lane metadata should be treated as legacy historical entries.
The chat context panel also surfaces somatic state, current work, executive intent, and recent background-task counts.
Voice controls let the operator record a prompt, transcribe it, and synthesize spoken replies when configured.

Desktop Context / Body Double

The Body Double plugin is exposed through the desktop_context_* tools. It can take a screenshot/OCR observation, inspect current MPRIS media state, retrieve a YouTube transcript when available, and optionally transcribe the currently playing audio with local Whisper.

Relevant configuration keys:

Key	Meaning
`enabled`	Enables desktop-context observations.
`media_commentary_mode_enabled`	Allows ongoing media commentary observations after a video-commentary request.
`youtube_transcripts_enabled`	Retrieves prefetched YouTube transcript evidence when a YouTube URL is detected.
`live_transcription_enabled`	Captures short currently playing audio windows and transcribes them with local Whisper.
`live_transcription_capture_seconds`	Length of each audio window. Default: `6`.
`live_transcription_min_interval_seconds`	Minimum interval before another live audio transcription for the same media target. Default: `12`.
`live_transcription_audio_input`	Optional Pulse/PipeWire monitor source for `ffmpeg`; when empty OpenCAS tries the default sink monitor.
`live_transcription_whisper_model`	Whisper CLI model name. Default: `base`.
`livestream_resume_catchup_enabled`	After Body Double pauses a confirmed livestream to speak, resume with a catch-up playback-speed request. Default: `true`.
`livestream_resume_catchup_rate`	Catch-up playback rate. Default: `1.5`.
`livestream_resume_catchup_max_seconds`	Maximum time before restoring normal speed after catch-up starts. Default: `90`.

For media commentary, the service only uses live transcription for a currently playing media target. It skips live transcription when the video is paused, stopped, or when multiple media items are playing and the target is ambiguous.

When both prefetched and live transcripts exist, the prefetched transcript is used as the timestamped map and the live Whisper excerpt is used as the current heard segment. This is the path used for livestream following and for checking that commentary is aligned to the current moment rather than an older or future part of a transcript.

When Body Double speaks over a single confirmed livestream, it pauses the current media first and resumes only the player it paused. If the stream is marked live by media metadata or an explicit livestream title/status signal, OpenCAS asks playback to catch up at 1.5x; ordinary YouTube videos are not sped up just because MPRIS omits duration metadata. Browser YouTube speed shortcuts are only allowed after a foreground verifier confirms the focused window is the intended YouTube player. If that proof is unavailable, OpenCAS skips the shortcut instead of typing into the foreground application. If the media changes before the delayed normal-speed restore, OpenCAS skips the shortcut restore instead of acting on a stale video.

Memory And Recall

OpenCAS memory behavior is currently exposed through both runtime behavior and the Memory dashboard tab:

episodic and distilled memories
semantic retrieval using provider-backed embeddings by default
artifact-backed autobiographical memory for authored plans/stories/notes
retrieval inspection endpoints and UI
connected graph and landscape views

Daydreaming

Daydreaming is a background autonomy path, not a chat command. When idle conditions and cooldown rules allow it, OpenCAS can produce:

reflections
keeper memories
daydream-origin work objects
conflict records

These are visible through the Daydream dashboard tab and the /api/daydream/* routes.

Schedule

Use the Schedule tab when you need durable recurring work instead of a one-off chat action:

create task or reminder schedules
inspect future run windows
trigger or cancel a schedule from the dashboard
review run history for missed or repeated items

Platform

Use the Platform tab when you need to inspect or change the extension surface:

view canonical capabilities
inspect extension bundles before install or update
manage extension lifecycle state
adjust plugin trust policies and trust feed sync

Phone

Phone configuration is exposed through the System surface and the /api/phone/* routes:

owner and caller workspace separation
public base URL and webhook settings
Twilio and voice model configuration
recent call review and owner call initiation

Telegram is a configured channel in the current repo. Use the System tab or the TUI bootstrap to configure:

bot token
pairing/allowlist/open DM policy
allowlisted Telegram user ids
poll interval and pairing TTL

Usage And Cost Monitoring

The Usage tab is the operator-facing place to inspect:

token totals
dominant usage sources
dominant models
recent large usage events
provider telemetry notes when available
process hygiene context for runaway spend investigations

Logs

The Logs tab is the event stream when you need to answer “what happened?” quickly:

filter by event kind
filter by session
search by text
inspect a recent timeline without opening the raw telemetry store

Practical Workflow

Launch with python -m opencas --with-server, adding --accept-bootstrap-responsibility if this is a non-TUI fresh bootstrap.
Open the Chat tab and confirm the active provider/model lane.
Use the System tab if the effective model, Telegram, phone, or voice setup looks wrong.
Use Memory when debugging recall quality.
Use Operations when debugging task state, receipts, or background processes.
Use Schedule when debugging recurring work or missed runs.
Use Platform when debugging extension installation or trust policy issues.
Use Logs when you need the raw event sequence.
Use Usage when debugging spend or rate-limit surprises.

Stopping OpenCAS

Stop the server with Ctrl+C in the terminal running the process.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenCAS Usage Guide

Recommended Operator Path

Important Accuracy Note

Useful CLI Flags

Dashboard Tabs

Chat Behavior

Desktop Context / Body Double

Memory And Recall

Daydreaming

Schedule

Platform

Phone

Telegram

Usage And Cost Monitoring

Logs

Practical Workflow

Stopping OpenCAS

Related Docs

FilesExpand file tree

usage.md

Latest commit

History

usage.md

File metadata and controls

OpenCAS Usage Guide

Recommended Operator Path

Important Accuracy Note

Useful CLI Flags

Dashboard Tabs

Chat Behavior

Desktop Context / Body Double

Memory And Recall

Daydreaming

Schedule

Platform

Phone

Telegram

Usage And Cost Monitoring

Logs

Practical Workflow

Stopping OpenCAS

Related Docs