TraceOps

HDFS log analysis agent. Retrieves relevant log chunks, generates an answer, scores confidence across three signals, then decides whether to suggest, flag for review, or abstain.

retrieve → reason → confidence → policy → suggest / review / abstain

No tool-picking loops. The LLM answers; the pipeline decides if that answer is trustworthy.

Setup

pip install -r requirements.txt

Needs Ollama running locally with a model pulled (ollama pull mistral). For other providers set the relevant env var (GOOGLE_API_KEY, OPENAI_API_KEY, etc.) and pass --llm-provider.

Usage

# download dataset and run
python main.py --dataset HDFS_v1 --download

# single query
python main.py --demo --query "Why is blk_-1233456789 failing to replicate?"

# different LLM
python main.py --demo --llm-provider google --llm-model gemini-1.5-flash

# tail a live log
python main.py --live data/HDFS.log --live-mode interactive

# eval only
python main.py --eval-only --n-eval-queries 200

Confidence

Three signals, weighted sum:

Signal	Default weight
Grounding (token overlap with retrieved context)	0.40
Self-consistency (cosine sim across two generations)	0.35
Citation density	0.25

Thresholds: suggest ≥ 0.75, review ≥ 0.40, abstain below that. Both configurable in config.py.

MCP

Runs as an MCP server over stdio. Any MCP client (Claude Desktop, Cursor, etc.) can call retrieve_logs, run_agent_query, score_confidence, knowledge_graph_context, list_anomalies.

pip install "mcp>=1.0.0"
python -m src.mcp_server.server

mcp.json at the repo root has ready-to-paste configs for both local venv and Docker.

Docker

docker compose up -d
docker compose exec ollama ollama pull mistral
docker compose run --rm traceops python main.py --demo

Copy .env.example to .env and fill in your provider/model.

Tests

python -m pytest tests/ -v

CI runs on every PR (Python 3.11 + 3.12, flake8, pytest --cov, Docker smoke test).

Data

LogHub HDFS dataset — put HDFS.log and anomaly_label.csv in data/, or use --download. Falls back to synthetic logs if neither is present.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github/workflows		.github/workflows
data		data
outputs		outputs
src		src
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
config.py		config.py
docker-compose.yml		docker-compose.yml
main.py		main.py
mcp.json		mcp.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TraceOps

Setup

Usage

Confidence

MCP

Docker

Tests

Data

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TraceOps

Setup

Usage

Confidence

MCP

Docker

Tests

Data

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages