Intelligent delegation for Claude Code. Save tokens by delegating work to other models. Keep strategy, design, and review for deep-thinking models; offload long-context analysis, bulk work, and summarization to inexpensive external LLM services with quota- and usage-aware tooling.
From an AI coding assistant's perspective:
Conjure turns delegation into a first-class workflow. I stay focused on architecture and code review while Conjure routes heavy lifting to the right external model. The unified executor, quota tracking, and skill hooks mean I can suggest Gemini for million-token reads, switch to Qwen for sandboxed execution, and still keep the "why" in Claude's head.
Concrete impact:
- Faster turn-around on large, repetitive tasks
- Safer delegation with quota warnings before rate limits bite
- Consistent handoffs via shared skills and hooks
- Better continuity: usage logs and status reports keep teams aligned
— Claude Code, November 24, 2025
The plugin wires in skills, hooks, and delegation tooling automatically.
/plugin install athola@claude-night-market
/statusuv sync # install deps (or: make deps)
make install-hooks # pre-commit hooks
make test # lint + type + security checksRequirements: Python 3.10+, uv, and tiktoken
for accurate token estimates:
uv add tiktoken
# Check delegation readiness (auth + CLI availability)
make delegate-verify
# Auto-pick best service for a task
make delegate-auto PROMPT="Summarize src" FILES="src/"
# Monitor limits and usage
make quota-status
make usage-report# List services
uv run python tools/delegation_executor.py --list-services
# Verify a service
uv run python tools/delegation_executor.py --verify gemini
# Auto-select based on requirements
uv run python tools/delegation_executor.py auto "Analyze this code" \
--files src/ --requirement large_context
# Force a specific service
uv run python tools/delegation_executor.py gemini "Summarize" \
--files docs/*.md --model gemini-2.5-pro-exp# Development
make format # ruff format + check --fix
make lint # ruff check
make typecheck # mypy + ty
make security-check # bandit
make test # lint + type + security bundle
make validate-all # full validation including hooks
make clean # remove caches/venv
# Delegation lifecycle
make delegate-status
make delegate-verify
make delegate-usage
make delegate-test
make delegate-gemini PROMPT="Analyze" FILES="src/main.py"
make delegate-qwen PROMPT="Extract" FILES="src/**/*.py"
make delegate-auto PROMPT="Best service" FILES="src/"
# Quota & usage
make quota-status
make usage-report# Quota tracker (Gemini)
uv run python tools/quota_tracker.py --status
uv run python tools/quota_tracker.py --estimate src/ docs/
uv run python tools/quota_tracker.py --validate-config
# Usage logger (Gemini)
uv run python tools/usage_logger.py --report
uv run python tools/usage_logger.py --validate
uv run python tools/usage_logger.py --statusUse skills directly in chat:
Skill(conjure:delegation-core)
Skill(conjure:gemini-delegation)
Skill(conjure:qwen-delegation)
Hooks (bridge.on_tool_start, bridge.after_tool_use) surface delegation suggestions when tasks grow large or noisy.
Auto-selects the best external service based on requirements (e.g., large context vs. sandbox execution).
Force a specific service with optional file globs and model hints.
Shows current Gemini quota usage with warnings for approaching per-minute or daily limits.
Summarizes recent Gemini requests, token counts, and success rate from ~/.claude/hooks/gemini/logs/usage.jsonl.
Checks configuration integrity for delegation limits and paths.
- Claude Code plugin – Registers skills, commands, and Gemini hooks.
- Skills –
delegation-core,gemini-delegation,qwen-delegationfor assessment and execution paths. - Delegation executor –
tools/delegation_executor.pyprovides unified command construction, verification, and execution with token estimation. - Quota tracker –
tools/quota_tracker.pymonitors rate/daily limits with warnings. - Usage logger –
tools/usage_logger.pyrecords requests, tokens, success, and duration with session rollups. - Hooks –
hooks/gemini/bridge.*recommend delegation when tool output size or volume suggests it. - Makefile – Single entry point for dev, validation, and delegation workflows.
- Assess –
delegation-coreevaluates if a task should delegate based on size, repetition, or sandbox needs. - Select –
delegate-autopicks Gemini (large context) or Qwen (sandbox/CLI) using service metadata and requirements. - Execute –
delegation_executorbuilds and runs service-specific commands, capturing stdout/stderr, timing, and estimated tokens. - Monitor –
quota_trackerwarns on rate/daily limits;usage_loggerlogs outcomes for reports. - Integrate – Results return to Claude for review, edits, and next actions.
- Delegation config overrides:
~/.claude/hooks/delegation/config.json - Quota data:
~/.claude/hooks/gemini/usage.json - Usage logs:
~/.claude/hooks/gemini/logs/usage.jsonl - Make targets reference
uvfor dependency management; adjust limits viaDEFAULT_LIMITSintools/quota_tracker.py.
uv sync
make lint typecheck security-check
make testSee CHANGELOG.md for release notes (current: 1.1.0) and LICENSE (MIT).
MIT