docs: define learning runtime verification matrix#235
Conversation
|
@kapi-agent review Revision explanation for current head What changed:
Why this closes #191:
Verification:
|
kapi-agent review completedThe formal GitHub PR review and required check were submitted separately. This request comment is kept concise to avoid duplicating the full review body. kapi-agent request lifecycle:
|
There was a problem hiding this comment.
kapi-agent review
Verdict: APPROVE
Final approval summary
Review journey
This PR’s original goal is to close #191 by defining the verification contract for the learning parallel runtime before any runtime-readiness/default claims are made. The semantic scope is bounded to documentation plus doc contract tests: one new verification matrix, one README discoverability link, and one test file that guards key matrix sections and issue closeout gates. There was no prior kapi-agent review on this head, so no review/fix ping-pong was required.
What changed
- Added
docs/learning-runtime-verification-matrix.md. - Added
test/learning-runtime-verification-matrix.test.ts. - Updated
README.mdto link the new matrix from the documented layout.
Why this is correct
The matrix covers the expected #191 surfaces: unit, integration, E2E smoke, failure modes, required artifacts/fixtures, and child-issue closeout evidence. It also explicitly keeps the change design-only and blocks runtime-readiness claims until executable coverage or recorded smoke evidence exists for MVP-critical rows.
Evidence
- Verifier gate: PASS.
- Size gate: PASS, 167 changed lines under the 200-line semantic review threshold.
- Revision-explanation status: not required, but provided.
- Ilchul review harness: PASS, LOW_RISK, no blocking findings.
- Verified evidence includes
npm ci && npm run verifypassing. - Inspected files:
README.md,docs/learning-runtime-verification-matrix.md, andtest/learning-runtime-verification-matrix.test.ts.
Remaining risks and approval rationale
Remaining risk is limited to the matrix being a design contract rather than executable runtime validation. That is acceptable for #191 because the PR clearly states this is not a runtime or CI behavior change, and the tests preserve discoverability and minimum contract structure. Approval is justified because all gates pass and no blocking correctness, safety, workflow-contract, or regression issue remains.
Blocking issues
None.
Warnings / risks
docs/learning-runtime-verification-matrix.md: the document intentionally names some “target” test surfaces that may not exist or may not fully cover the stated behavior yet. This is acceptable for a design-only matrix, but future runtime-readiness PRs should not treat these rows as satisfied without executable evidence.
Suggestions
- Consider adding a trailing newline to
docs/learning-runtime-verification-matrix.mdin a follow-up cleanup. - Future implementation PRs should link their new tests/smoke artifacts back to the exact matrix rows they satisfy.
Looks good
- The matrix clearly separates unit, integration, E2E smoke, and failure-mode expectations.
- The child-issue closeout table gives future PRs a concrete evidence contract.
- README discoverability is covered.
- The contract tests are appropriately lightweight for a docs/design slice and guard against accidental removal of the important sections and issue gates.
Verification notes
- Verifier gate status: PASS —
npm ci && npm run verifyexited 0. - Size gate status: PASS — 167 changed lines, below the 200-line review threshold.
- Revision-explanation status: not required; explanation found.
- Ilchul review harness gate: PASS — LOW_RISK, no blocking findings.
- Local/CI evidence in the PR body and gate output includes tests, type/check gates, unused checks, and quality budgets passing.
Engine: pi
Summary
docs/learning-runtime-verification-matrix.mdas the Design: define verification matrix for learning parallel runtime #191 verification contract for the learning parallel runtime.Linked issue
Closes #191
Problem
#167 now has runtime MVP slices through task graph, claim/lease, and worker execution. Before runtime readiness/default claims, #191 needs a concrete verification matrix that maps every runtime invariant to test categories, smoke artifacts, failure-mode expectations, fixtures, and child-issue closeout evidence.
Without that matrix, later runtime PRs can claim readiness from partial unit coverage or narrative worker output instead of executable evidence.
Options considered
Selected approach
Selected option: dedicated design doc plus contract tests.
Why: it satisfies the design-only issue without changing runtime behavior, while still making the matrix enforceable enough that future changes cannot silently drop the verification contract.
Implementation by file/surface
docs/learning-runtime-verification-matrix.mdstate.json,events.jsonl, worker report fixtures, objective/evaluation, reward ledger, integration dry-run, command output.test/learning-runtime-verification-matrix.test.tsREADME.mdWhy this fixes it
The new matrix directly addresses every #191 acceptance criterion:
It also keeps runtime-readiness claims blocked until executable or recorded smoke evidence covers the MVP rows.
QA / Verification
npm ci— pass.npm run check— pass.npm run check:unused— pass.npm test -- test/learning-runtime-verification-matrix.test.ts— pass; package script ran full suite (537tests,526pass,11skipped).npm run quality:budgets— pass with existing non-failingcode_smells=60warning.git diff --cached --check— pass.3 files changed, 167 insertions(+).Anomalies observed
npm ci; initialnpm run checkfailed because@types/nodewas not installed yet.npm test -- test/learning-runtime-verification-matrix.test.tsruns the full suite because the package script includestest/*.test.ts.Risks / Follow-up
kapi-agent review expectations and current-head merge gate
kapi-agentapproval and successfulkapi-agent/review/ formal approval checks before merge.