Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
37 commits
Select commit Hold shift + click to select a range
6287509
feat: make results directory output optional (#107)
pcarleton Jan 14, 2026
0675e9b
update npm audit (#108)
KKonstantinov Jan 19, 2026
4b5c835
chore: update licensing to Apache 2.0 for new contributions (#100)
domdomegg Jan 19, 2026
7a3b13a
0.1.10
pcarleton Jan 20, 2026
aeaa406
chore: add .idea/ to .gitignore (#111)
maxisbey Jan 21, 2026
cc784d9
Update warning message in README.md (#112)
Kludex Jan 22, 2026
83ff334
feat: add --expected-failures baseline and composite GitHub Action (#…
maxisbey Jan 22, 2026
d0b2b49
feat: add DNS rebinding protection conformance tests (#115)
pcarleton Jan 22, 2026
5dca74b
feat: add core and extensions suites for SDK tiering (#117)
pcarleton Jan 22, 2026
245e44e
feat: add pre-registration conformance test (#120)
pcarleton Jan 23, 2026
2d52de5
docs: add SDK integration guide and fix spec references (#121)
pcarleton Jan 23, 2026
4a2ba8a
feat: add PKCE conformance tests (#119)
pcarleton Jan 23, 2026
67ad358
feat: add resource parameter validation tests (RFC 8707) (#118)
pcarleton Jan 23, 2026
c2f3fda
0.1.11 (#122)
pcarleton Jan 23, 2026
0064ea0
Add allowClientError to Scenario interface for expected client failur…
mikekistler Jan 26, 2026
4f91d7f
ci: migrate to OIDC trusted publishing, add workflow_dispatch for pre…
pcarleton Jan 28, 2026
df87099
feat: move march back-compat auth tests to optional backcompat suite …
pcarleton Jan 29, 2026
4bfafd5
chore: 0.1.12 (#128)
pcarleton Jan 29, 2026
189a31d
Bump hono from 4.11.4 to 4.11.7 (#124)
dependabot[bot] Jan 30, 2026
22196ca
fix: bump SDK to 1.26.0, add session management to tools_call scenari…
pcarleton Feb 5, 2026
a1b8aaa
0.1.13 (#136)
pcarleton Feb 5, 2026
37225ce
fix: create fresh Server per request in auth test helpers (#141)
felixweinberger Feb 10, 2026
3b4a92b
feat: SDK tier assessment CLI and skill (#142)
felixweinberger Feb 12, 2026
c82fd65
fix: recurse into subdirectories when parsing conformance output (#148)
felixweinberger Feb 13, 2026
83c446d
feat: add conformance tests for SEP-990 (#110)
sagar-okta Feb 13, 2026
dd14862
fix: use InvalidTokenError instead of generic Error in MockTokenVerif…
jdmaturen Feb 13, 2026
3f9048a
feat: add specVersion classification to conformance scenarios (#147)
felixweinberger Feb 13, 2026
8ab0831
0.1.14 (#149)
pcarleton Feb 13, 2026
066b2d7
fix: tier-check branch support and draft/extension scoring (#153)
felixweinberger Feb 17, 2026
353ea70
feat: validate PRM resource field
wdawson Feb 17, 2026
20797ed
feat: cleanup www-authenticate checks
wdawson Feb 17, 2026
27f0bed
fix: return content-type header in 404 case for callback server
wdawson Feb 17, 2026
4f122cd
feat: add check for invalid access token
wdawson Feb 18, 2026
292c81e
feat: add CIMD tests and conform to same ID pattern
wdawson Feb 18, 2026
769e66c
feat: support for pre-registered client
wdawson Feb 19, 2026
435caea
Merge branch 'main' into server-auth-conformance
wdawson Feb 19, 2026
8ebd8dd
fix: lint
wdawson Feb 19, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
258 changes: 258 additions & 0 deletions .claude/skills/mcp-sdk-tier-audit/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,258 @@
# MCP SDK Tier Audit

Assess any MCP SDK repository against [SEP-1730](https://github.com/modelcontextprotocol/modelcontextprotocol/issues/1730) (the SDK Tiering System). Produces a tier classification (1/2/3) with an evidence-backed scorecard.

Two components work together:

- **`tier-check` CLI** — runs deterministic checks (server + client conformance pass rate, issue triage speed, P0 resolution, labels, releases, policy signals). Works standalone, no AI needed.
- **AI-assisted assessment** — an agent uses the CLI scorecard plus judgment-based evaluation (documentation coverage, dependency policy, roadmap) to produce a full tier report with remediation guide.

## Quick Start: CLI

The CLI is a subcommand of the [MCP Conformance](https://github.com/modelcontextprotocol/conformance) tool.

```bash
# Clone and build
git clone https://github.com/modelcontextprotocol/conformance.git
cd conformance
npm install
npm run build

# Authenticate with GitHub (needed for API access)
gh auth login

# Run against any MCP SDK repo (without conformance tests)
npm run --silent tier-check -- --repo modelcontextprotocol/typescript-sdk --skip-conformance
```

The CLI uses the GitHub API (read-only) for issue metrics, labels, and release checks. Authenticate via one of:

- **GitHub CLI** (recommended): `gh auth login` — the CLI picks up your token automatically
- **Environment variable**: `export GITHUB_TOKEN=ghp_...`
- **Flag**: `--token ghp_...`

For public repos, any authenticated token works (no special scopes needed — authentication just avoids rate limits). For a [fine-grained personal access token](https://github.com/settings/personal-access-tokens/new), select **Public Repositories (read-only)** with no additional permissions.

### CLI Options

```
--repo <owner/repo> GitHub repository (required)
--branch <branch> Branch to check
--skip-conformance Skip conformance tests
--conformance-server-url <url> URL of the already-running conformance server
--client-cmd <cmd> Command to run the SDK conformance client (for client conformance tests)
--days <n> Limit triage analysis to last N days
--output <format> json | markdown | terminal (default: terminal)
--token <token> GitHub token (defaults to GITHUB_TOKEN or gh auth token)
```

### What the CLI Checks

| Check | What it measures |
| ------------------ | ------------------------------------------------------------------------------ |
| Server Conformance | Pass rate of server implementation against the conformance test suite |
| Client Conformance | Pass rate of client implementation against the conformance test suite |
| Labels | Whether SEP-1730 label taxonomy is set up (supports GitHub native issue types) |
| Triage | How quickly issues get labeled after creation |
| P0 Resolution | Whether critical bugs are resolved within SLA |
| Stable Release | Whether a stable release >= 1.0.0 exists |
| Policy Signals | Presence of CHANGELOG, SECURITY, CONTRIBUTING, dependabot, ROADMAP |
| Spec Tracking | Gap between latest spec release and SDK release |

### Example Output

```
Tier Assessment: Tier 2

Repo: modelcontextprotocol/typescript-sdk
Timestamp: 2026-02-10T12:00:00Z

Check Results:

✓ Server Conformance 45/45 (100%)
✓ Client Conformance 4/4 (100%)
✗ Labels 9/12 required labels
Missing: needs confirmation, needs repro, ready for work
✓ Triage 92% within 2BD (150 issues, median 8h)
✓ P0 Resolution 0 open, 3/3 closed within 7d
✓ Stable Release 2.3.1
~ Policy Signals ✓ CHANGELOG.md, ✗ SECURITY.md, ✓ CONTRIBUTING.md, ✓ .github/dependabot.yml, ✗ ROADMAP.md
✓ Spec Tracking 2d gap
```

Use `--output json` to get machine-readable results, or `--output markdown` for a report you can paste into an issue.

## Full AI-Assisted Assessment

The CLI produces a deterministic scorecard, but some SEP-1730 requirements need judgment: documentation quality, dependency policy, roadmap substance. An AI agent can evaluate these by reading the repo.

### Claude Code

The skill lives in `.claude/skills/` in this repo, so if you open [Claude Code](https://docs.anthropic.com/en/docs/claude-code) in the conformance repo it's already available.

1. Make sure `gh auth login` is done (the skill checks this upfront)
2. Start the SDK's everything server in a separate terminal
3. Run the skill:

```
/mcp-sdk-tier-audit <local-sdk-path> <conformance-server-url> [client-cmd]
```

Pass the client command as the third argument to include client conformance testing. If omitted, client conformance is skipped and noted as a gap in the report.

**TypeScript SDK example:**

```bash
# Terminal 1: start the everything server (build first: npm run build)
cd ~/src/mcp/typescript-sdk && npm run test:conformance:server:run

# Terminal 2: run the audit (from the conformance repo)
/mcp-sdk-tier-audit ~/src/mcp/typescript-sdk http://localhost:3000/mcp "npx tsx ~/src/mcp/typescript-sdk/test/conformance/src/everythingClient.ts"
```

**Python SDK example:**

```bash
# Terminal 1: install and start the everything server
cd ~/src/mcp/python-sdk && uv sync --frozen --all-extras --package mcp-everything-server
uv run mcp-everything-server --port 3001

# Terminal 2: run the audit (from the conformance repo)
/mcp-sdk-tier-audit ~/src/mcp/python-sdk http://localhost:3001/mcp "uv run python ~/src/mcp/python-sdk/.github/actions/conformance/client.py"
```

**Go SDK example:**

```bash
# Terminal 1: build and start the everything server
cd ~/src/mcp/go-sdk && go build -o /tmp/go-conformance-server ./conformance/everything-server
go build -o /tmp/go-conformance-client ./conformance/everything-client
/tmp/go-conformance-server -http="localhost:3002"

# Terminal 2: run the audit (from the conformance repo)
/mcp-sdk-tier-audit ~/src/mcp/go-sdk http://localhost:3002 "/tmp/go-conformance-client"
```

**C# SDK example:**

```bash
# Terminal 1: start the everything server (requires .NET SDK)
cd ~/src/mcp/csharp-sdk
dotnet run --project tests/ModelContextProtocol.ConformanceServer --framework net9.0 -- --urls http://localhost:3003

# Terminal 2: run the audit (from the conformance repo)
/mcp-sdk-tier-audit ~/src/mcp/csharp-sdk http://localhost:3003 "dotnet run --project ~/src/mcp/csharp-sdk/tests/ModelContextProtocol.ConformanceClient"
```

The skill derives `owner/repo` from git remote, runs the CLI, launches parallel evaluations for docs and policy, and writes detailed reports to `results/`.

### Any Other AI Coding Agent

If you use a different agent (Codex, Cursor, Aider, OpenCode, etc.), give it these instructions:

1. **Run the CLI** to get the deterministic scorecard:

```bash
node dist/index.js tier-check --repo <repo> --conformance-server-url <url> --output json
```

2. **Evaluate documentation coverage** — check whether MCP features (tools, resources, prompts, sampling, transports, etc.) are documented with examples. See [`references/docs-coverage-prompt.md`](references/docs-coverage-prompt.md) for the full checklist.

3. **Evaluate policies** — check for dependency update policy, roadmap, and versioning/breaking-change policy. See [`references/policy-evaluation-prompt.md`](references/policy-evaluation-prompt.md) for criteria.

4. **Apply tier logic** — combine scorecard + evaluations against the thresholds in [`references/tier-requirements.md`](references/tier-requirements.md).

5. **Generate report** — use [`references/report-template.md`](references/report-template.md) for the output format.

### Manual Review

Run the CLI for the scorecard, then review docs and policies yourself using the tier requirements as a checklist:

| Requirement | Tier 1 | Tier 2 |
| ------------------ | ------------------------------ | ------------------------ |
| Server Conformance | 100% pass | >= 80% pass |
| Client Conformance | 100% pass | >= 80% pass |
| Issue triage | Within 2 business days | Within 1 month |
| P0 resolution | Within 7 days | Within 2 weeks |
| Stable release | >= 1.0.0 with clear versioning | At least one >= 1.0.0 |
| Documentation | All features with examples | Core features documented |
| Dependency policy | Published | Published |
| Roadmap | Published with spec tracking | Plan toward Tier 1 |

## Running Conformance Tests

To include conformance test results, start the SDK's everything server first, then pass the URL to the CLI. To also run client conformance tests, pass `--client-cmd` with the command to launch the SDK's conformance client.

**TypeScript SDK**:

```bash
# Terminal 1: start the server (SDK must be built first)
cd ~/src/mcp/typescript-sdk && npm run build
npm run test:conformance:server:run # starts on port 3000

# Terminal 2: run tier-check (server + client conformance)
npm run --silent tier-check -- \
--repo modelcontextprotocol/typescript-sdk \
--conformance-server-url http://localhost:3000/mcp \
--client-cmd 'npx tsx ~/src/mcp/typescript-sdk/test/conformance/src/everythingClient.ts'
```

**Python SDK**:

```bash
# Terminal 1: install and start the server
cd ~/src/mcp/python-sdk
uv sync --frozen --all-extras --package mcp-everything-server
uv run mcp-everything-server --port 3001 # specify port to avoid conflicts

# Terminal 2: run tier-check (server + client conformance)
npm run --silent tier-check -- \
--repo modelcontextprotocol/python-sdk \
--conformance-server-url http://localhost:3001/mcp \
--client-cmd 'uv run python ~/src/mcp/python-sdk/.github/actions/conformance/client.py'
```

**Go SDK**:

```bash
# Terminal 1: build and start the server
cd ~/src/mcp/go-sdk
go build -o /tmp/go-conformance-server ./conformance/everything-server
go build -o /tmp/go-conformance-client ./conformance/everything-client
/tmp/go-conformance-server -http="localhost:3002"

# Terminal 2: run tier-check (server + client conformance)
npm run --silent tier-check -- \
--repo modelcontextprotocol/go-sdk \
--conformance-server-url http://localhost:3002 \
--client-cmd '/tmp/go-conformance-client'
```

**C# SDK**:

```bash
# Terminal 1: start the server (requires .NET SDK)
cd ~/src/mcp/csharp-sdk
dotnet run --project tests/ModelContextProtocol.ConformanceServer --framework net9.0 -- --urls http://localhost:3003

# Terminal 2: run tier-check (server + client conformance)
npm run --silent tier-check -- \
--repo modelcontextprotocol/csharp-sdk \
--conformance-server-url http://localhost:3003 \
--client-cmd 'dotnet run --project ~/src/mcp/csharp-sdk/tests/ModelContextProtocol.ConformanceClient'
```

**Other SDKs:** Your SDK needs an "everything server" — an HTTP server implementing the [Streamable HTTP transport](https://modelcontextprotocol.io/specification/draft/basic/transports.md) with all MCP features (tools, resources, prompts, etc.). See the implementations above as reference.

Start your everything server, then pass `--conformance-server-url`. Pass `--client-cmd` if your SDK has a conformance client. If neither exists yet, use `--skip-conformance` — the scorecard will note this as a gap.

## Reference Files

These files in [`references/`](references/) contain the detailed criteria and prompts:

| File | Purpose |
| ----------------------------- | ------------------------------------------------------- |
| `tier-requirements.md` | Full SEP-1730 requirements with exact thresholds |
| `docs-coverage-prompt.md` | Feature checklist for documentation evaluation |
| `policy-evaluation-prompt.md` | Criteria for dependency, roadmap, and versioning policy |
| `report-template.md` | Output format for the full audit report |
Loading