feat: add score command by adamaltman · Pull Request #2648 · Redocly/redocly-cli

adamaltman · 2026-03-12T01:21:13Z

What/Why/How?

What: Adds a new score command to Redocly CLI that analyzes OpenAPI 3.x descriptions and produces two composite scores: Integration Simplicity (0-100, how easy is this API to integrate) and Agent Readiness (0-100, how usable is this API by AI agents/LLM tooling).

Why: API producers currently lack a quick, deterministic way to assess how developer-friendly or AI-agent-friendly their API descriptions are. The existing stats command counts structural elements but doesn't evaluate quality signals like documentation coverage, example presence, schema complexity, or error response structure. This command fills that gap with actionable, explainable scores.

How: The implementation follows the same pattern as the stats command (bundle + analyze), with a clean separation between metric collection and score calculation:

Metric collection (collectors/): Walks the bundled document, resolving internal $refs, to gather per-operation raw metrics (parameter counts, schema depth, polymorphism, description/constraint/example coverage, structured error responses, workflow dependency depth via shared schema refs).
Scoring (scoring.ts): Pure functions that normalize raw metrics into subscores and compute weighted composite scores. Thresholds and weights are configurable constants. anyOf is penalized more heavily than oneOf/allOf; discriminator presence improves the agent readiness polymorphism clarity subscore.
Hotspots (hotspots.ts): Identifies the operations with the most issues, sorted by number of reasons, with human-readable explanations.
Output: --format=stylish (default, with color bar charts) and --format=json (machine-readable for CI/dashboards).

Reference

Related to API governance and developer experience tooling. No existing issue -- this is a new feature.

Testing

43 unit tests across 3 test files covering schema depth calculation, $ref resolution, polymorphism counting (oneOf/anyOf/allOf), constraint detection (including const), example coverage scoring, anyOf penalty multiplier, discriminator impact on agent readiness, deterministic output, and score range validation.
Manually tested against three real OpenAPI descriptions (Redocly Cafe: 12 operations, Reunite Main: 299 operations, Rebilly: 606 operations) to verify scores are reasonable and hotspot reasoning is actionable.
TypeScript compiles cleanly (tsc --noEmit), all existing tests continue to pass.

Screenshots (optional)

Stylish output for Redocly Cafe:

  Scores

  Integration Simplicity:  85.3/100
  Agent Readiness:         94.4/100

  Integration Simplicity Subscores

  Parameter Simplicity     [█████████████████░░░] 83%
  Schema Simplicity        [████████████░░░░░░░░] 62%
  Documentation Quality    [███████████████████░] 97%
  Constraint Clarity       [███████████████████░] 96%
  Example Coverage         [██████████████████░░] 92%
  Error Clarity            [████████████████████] 100%
  Workflow Clarity         [█████████████████░░░] 83%

  Top 4 Hotspot Operations

  GET /orders (listOrders)
    Integration Simplicity: 69.1  Agent Readiness: 93.9
    - High parameter count (6)
    - Deep schema nesting (depth 5)

  PATCH /orders/{orderId} (updateOrder)
    Integration Simplicity: 77.6  Agent Readiness: 85.3
    - Missing request body examples

Check yourself

Code changed? - Tested with Redoc/Realm/Reunite (internal)
All new/updated code is covered by tests
New package installed? - Tested in different environments (browser/node)
Documentation update considered

Security

The security impact of the change has been considered
Code follows company security practices and guidelines

Note

Medium Risk
Adds a new CLI command with non-trivial OpenAPI walking/scoring logic and extends the core public API (isMappingRef export), which could affect consumers and introduces new analysis paths that may need tuning for edge-case specs.

Overview
Adds a new experimental redocly score command that evaluates OpenAPI 3.x descriptions and outputs an Agent Readiness score (0–100) with normalized subscores, per-operation raw metrics, dependency-depth signals from shared $refs, and a ranked list of hotspot operations with human-readable reasons.

Implements metric collection via OpenAPI document walking (including schema depth/polymorphism/constraints/examples/error-structure and ambiguous parameter detection), scoring/aggregation and discoverability weighting, plus stylish and json formatters and an optional --debug-operation-id schema breakdown.

Updates CLI docs/navigation, adds unit + e2e snapshot coverage for the new command, publishes a changeset, and exports isMappingRef from @redocly/openapi-core (plus a small coverage threshold adjustment).

^{Reviewed by Cursor Bugbot for commit 6672b0c. Bugbot is set up for automated code reviews on this repo. Configure here.}

changeset-bot · 2026-03-12T01:21:17Z

🦋 Changeset detected

Latest commit: 6672b0c

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 3 packages

Name	Type
@redocly/cli	Minor
@redocly/openapi-core	Minor
@redocly/respect-core	Minor

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

github-actions · 2026-03-12T01:23:24Z

Coverage Report

Status	Category	Percentage	Covered / Total
🔵	Lines	79.74% (🎯 79%)	7107 / 8912
🔵	Statements	79.09% (🎯 78%)	7366 / 9313
🔵	Functions	83.21% (🎯 82%)	1413 / 1698
🔵	Branches	71.09% (🎯 70%)	4772 / 6712

File Coverage

File	Stmts	Branches	Functions	Lines	Uncovered Lines
Changed Files
packages/cli/src/commands/score/collect-metrics.ts	56.52%	44%	72.72%	60.8%	28, 83-85, 88-90, 93-95, 98, 111-114, 179-196, 137, 212, 237-238, 246-300
packages/cli/src/commands/score/constants.ts	100%	100%	100%	100%
packages/cli/src/commands/score/hotspots.ts	100%	97.67%	100%	100%
packages/cli/src/commands/score/index.ts	96.42%	72.72%	100%	96.42%	125
packages/cli/src/commands/score/scoring.ts	99.01%	92.15%	100%	100%	282
packages/cli/src/commands/score/collectors/dependency-graph.ts	100%	90%	100%	100%
packages/cli/src/commands/score/collectors/document-metrics.ts	92.02%	74.35%	100%	95.62%	123, 129-131, 151-157, 322, 336, 360, 392, 416-426, 457
packages/cli/src/commands/score/formatters/json.ts	100%	100%	100%	100%
packages/cli/src/commands/score/formatters/stylish.ts	58.65%	29.82%	70%	60.24%	10, 78, 79, 107, 146-268

Generated in workflow #9264 for commit 798d612 by the Vitest Coverage Report Action

github-actions · 2026-03-12T01:23:44Z

CLI Version	Mean Time ± Std Dev (s)	Relative Performance (Lower is Faster)
cli-latest	3.540s ± 0.028s	▓ 1.00x
cli-next	3.529s ± 0.019s	▓ 1.00x (Fastest)

.changeset/add-score-command.md

docs/@v2/commands/index.md

docs/@v2/commands/score.md

packages/cli/src/commands/score/collectors/document-metrics.ts

- Add "AI" before "agent readiness" in changeset and docs for clarity - Replace <pre> block with fenced code block in score.md - Add security scheme coverage to metrics documentation - Remove resolveIfRef helper, replace with resolveNode that falls back to the original node when resolution fails - Refactor to use walkDocument visitor approach (matching stats command pattern) instead of manually iterating the document tree - Use resolveDocument + normalizeVisitors + walkDocument from openapi-core for proper $ref resolution and spec-format extensibility - Update index.test.ts to mock the new walk infrastructure Made-with: Cursor

Made-with: Cursor

docs/@v2/commands/score.md

.changeset/add-score-command.md

Co-authored-by: Jacek Łękawa <164185257+JLekawa@users.noreply.github.com>

tatomyr

Left a couple of comments. I haven'd fully reviewed the scoring and collectors though as it takes time.

docs/@v2/commands/score.md

packages/cli/src/commands/score/__tests__/document-metrics.test.ts

packages/cli/src/commands/score/__tests__/formatters.test.ts

docs/@v2/commands/score.md

packages/cli/src/commands/score/index.ts

tatomyr

I found some dead code. Please check if it's needed and remove if not. Also, I'm not sure what tests to review as many appear to only test that dead code, so let's handle that first.

packages/cli/src/commands/score/__tests__/collect-metrics.ts

packages/cli/src/commands/score/collectors/document-metrics.ts

packages/cli/src/commands/score/__tests__/document-metrics.test.ts

packages/cli/src/commands/score/__tests__/formatters.test.ts

- Inline collect-metrics.ts test helper into document-metrics.test.ts - Use parseYaml as yaml directly instead of wrapper function - Remove default parameter from getStylishOutput in formatter tests - Use getMajorSpecVersion + exitWithError for spec version check - Add explicit case 'stylish' before default in format switch - Remove unsupported 'markdown' from score command format choices - Add comment explaining depth=-1 initialization - Clarify anyOf penalty and dependency terminology in docs - Update non-oas3 rejection test for exitWithError (throws) Made-with: Cursor

- Add type cast for parseYaml (returns unknown) in document-metrics tests - Inline collectDocumentMetrics helper into example-coverage tests - Remove collect-metrics.js import from scoring tests, use direct metrics - Add missing debugLogs property to accumulator mock in index tests Made-with: Cursor

packages/cli/src/commands/score/__tests__/document-metrics.test.ts

Extract the metric-collection pipeline from handleScore into a standalone collect-metrics.ts module with two exports: - collectMetrics(): low-level function used by handleScore - collectDocumentMetrics(): high-level convenience used by tests This eliminates ~300 lines of duplicated walker setup across three test files (document-metrics, example-coverage, index) and ensures tests exercise the same code path as the production command. Add $ref-keyed memoization to walkSchema so repeated references to the same component schema return cached stats instead of re-walking. Stripe API: 37.6s → 11.3s (3.3× faster). Made-with: Cursor

Add median alongside averages for parameters, schema depth, polymorphism, and properties in the stylish output. Rename the misleading "Avg max schema depth" to "Schema depth". Made-with: Cursor

Rename workflowClarity to dependencyClarity, workflowDepths to dependencyDepths, and workflow-graph to dependency-graph to align code naming with the "Dependency Clarity" display label. Also adds discoverability subscore, recursive composition keyword stripping for accurate property counting, and updates e2e snapshots. Made-with: Cursor

packages/cli/src/commands/score/collect-metrics.ts

packages/cli/src/commands/score/__tests__/formatters.test.ts

packages/cli/src/commands/score/__tests__/hotspots.test.ts

- Use isPlainObject from openapi-core for schema cycle detection instead of raw typeof checks that match arrays - Remove formatter unit tests in favor of e2e snapshot coverage - Clarify makeScores() purpose in hotspot tests Made-with: Cursor

docs/@v2/commands/score.md

packages/cli/src/commands/score/__tests__/index.test.ts

tatomyr · 2026-03-31T08:09:57Z

packages/cli/src/commands/score/__tests__/index.test.ts

+      metrics: makeDocumentMetrics(new Map([['listItems', makeTestMetrics()]])),
+      debugLogs: [],
+    });
+    process.exitCode = undefined;


Seems redundant. At least, I don't see where we use exitCode in the tests.

packages/cli/src/commands/score/collectors/document-metrics.ts

packages/cli/src/commands/score/formatters/json.ts

packages/cli/src/commands/score/collect-metrics.ts

packages/cli/src/commands/score/collectors/document-metrics.ts

packages/cli/src/commands/score/collect-metrics.ts

- Add experimental admonition to score command docs - Remove --config from usage examples - Remove redundant mockClear and process.exitCode in tests - Simplify JSON formatter by using spread instead of manual field mapping - Use isRef from openapi-core for $ref detection in collect-metrics - Use isPlainObject and isNotEmptyObject from openapi-core in document-metrics - Replace != null with Array.isArray for discriminatorRefs check - Remove unnecessary non-null assertions on discriminatorRefs - Lower branch coverage threshold to 70% per reviewer guidance Made-with: Cursor

cursor

Cursor Bugbot has reviewed your changes and found 4 potential issues.

Autofix Details

Bugbot Autofix prepared fixes for all 4 issues found in the latest run.

✅ Fixed: Discriminator flag lost with oneOf/anyOf polymorphism
- polyStats now sets hasDiscriminator when the parent schema has a discriminator (propertyName or non-empty mapping), merged with branch stats via OR.
✅ Fixed: Structured error count double-incremented per media type
- Added errorResponseStructuredCounted so structuredErrorResponseCount increases at most once per error response (description-only in Response.enter, or the first MediaType.enter that counts).
✅ Fixed: Independent max of correlated metrics across media types
- totalSchemaProperties, schemaPropertiesWithDescription, and constraintCount are now summed across media types instead of maxed independently.
✅ Fixed: Document parsed object mutated during metric collection
- collectMetrics now uses try/finally so restoreCompositionKeywords always runs after stripCompositionKeywords, even when inner work throws.

Or push these changes by commenting:

@cursor push b6568647da

Preview (b6568647da)

diff --git a/packages/cli/src/commands/score/collect-metrics.ts b/packages/cli/src/commands/score/collect-metrics.ts
--- a/packages/cli/src/commands/score/collect-metrics.ts
+++ b/packages/cli/src/commands/score/collect-metrics.ts
@@ -122,6 +122,30 @@
 }: CollectMetricsOptions): CollectMetricsResult {
   const removedComposition = stripCompositionKeywords(document.parsed);
 
+  try {
+    return collectMetricsInner({
+      document,
+      types,
+      resolvedRefMap,
+      ctx,
+      debugOperationId,
+      removedComposition,
+    });
+  } finally {
+    restoreCompositionKeywords(removedComposition);
+  }
+}
+
+function collectMetricsInner({
+  document,
+  types,
+  resolvedRefMap,
+  ctx,
+  debugOperationId,
+  removedComposition,
+}: CollectMetricsOptions & {
+  removedComposition: Map<object, StrippedComposition>;
+}): CollectMetricsResult {
   const schemaWalkState = createSchemaWalkState();
   const schemaVisitor = createSchemaMetricVisitor(schemaWalkState);
   const normalizedSchemaVisitors = normalizeVisitors(
@@ -236,6 +260,10 @@
         : null;
     const hasDiscriminatorBranches =
       Array.isArray(discriminatorRefs) && discriminatorRefs.length > 0;
+    const hasParentDiscriminator = !!(
+      disc?.propertyName ||
+      (isPlainObject(disc?.mapping) && Object.keys(disc.mapping).length > 0)
+    );
 
     let result: SchemaStats;
 
@@ -258,6 +286,7 @@
           ...maxBranch,
           polymorphismCount: maxBranch.polymorphismCount + polyBranches.length,
           anyOfCount: maxBranch.anyOfCount + (polyKeyword === 'anyOf' ? polyBranches.length : 0),
+          hasDiscriminator: maxBranch.hasDiscriminator || hasParentDiscriminator,
         };
       }
 
@@ -323,8 +352,6 @@
     ctx,
   });
 
-  restoreCompositionKeywords(removedComposition);
-
   return {
     metrics: getDocumentMetrics(accumulator),
     debugLogs: accumulator.debugLogs,

diff --git a/packages/cli/src/commands/score/collectors/document-metrics.ts b/packages/cli/src/commands/score/collectors/document-metrics.ts
--- a/packages/cli/src/commands/score/collectors/document-metrics.ts
+++ b/packages/cli/src/commands/score/collectors/document-metrics.ts
@@ -212,6 +212,8 @@
   inRequestBody: boolean;
   inResponse: boolean;
   currentResponseCode: string;
+  /** True once structured error counting ran for the current error response (Response or first MediaType). */
+  errorResponseStructuredCounted: boolean;
 
   refsUsed: Set<string>;
 }
@@ -278,6 +280,7 @@
     inRequestBody: false,
     inResponse: false,
     currentResponseCode: '',
+    errorResponseStructuredCounted: false,
 
     refsUsed: new Set(),
   };
@@ -337,8 +340,10 @@
 
         if (isErrorCode(code)) {
           current.totalErrorResponses++;
+          current.errorResponseStructuredCounted = false;
           if (!response.content && response.description) {
             current.structuredErrorResponseCount++;
+            current.errorResponseStructuredCounted = true;
           }
         }
       },
@@ -359,8 +364,13 @@
           if (current.inResponse) current.responseExamplePresent = true;
         }
 
-        if (current.inResponse && isErrorCode(current.currentResponseCode)) {
+        if (
+          current.inResponse &&
+          isErrorCode(current.currentResponseCode) &&
+          !current.errorResponseStructuredCounted
+        ) {
           current.structuredErrorResponseCount++;
+          current.errorResponseStructuredCounted = true;
         }
 
         if (mediaType.schema) {
@@ -372,15 +382,9 @@
           const stats = accumulator.walkSchema(mediaType.schema, isDebugTarget);
 
           current.propertyCount = Math.max(current.propertyCount, stats.propertyCount);
-          current.totalSchemaProperties = Math.max(
-            current.totalSchemaProperties,
-            stats.totalSchemaProperties
-          );
-          current.schemaPropertiesWithDescription = Math.max(
-            current.schemaPropertiesWithDescription,
-            stats.schemaPropertiesWithDescription
-          );
-          current.constraintCount = Math.max(current.constraintCount, stats.constraintCount);
+          current.totalSchemaProperties += stats.totalSchemaProperties;
+          current.schemaPropertiesWithDescription += stats.schemaPropertiesWithDescription;
+          current.constraintCount += stats.constraintCount;
           current.polymorphismCount = Math.max(current.polymorphismCount, stats.polymorphismCount);
           current.anyOfCount = Math.max(current.anyOfCount, stats.anyOfCount);
           if (stats.hasDiscriminator) current.hasDiscriminator = true;

_{This Bugbot Autofix run was free. To enable autofix for future PRs, go to the Cursor dashboard.}

packages/cli/src/commands/score/collect-metrics.ts

packages/cli/src/commands/score/collectors/document-metrics.ts

packages/cli/src/commands/score/collect-metrics.ts

- Fix duplicate import from @redocly/openapi-core (CI lint error) - Propagate hasDiscriminator flag for oneOf/anyOf + discriminator schemas - Prevent structured error response double-counting per media type - Keep totalSchemaProperties and schemaPropertiesWithDescription paired from the same schema to avoid misleading documentation quality ratios - Wrap composition stripping in try/finally to guarantee restoration - Export isMappingRef from openapi-core and use in collect-metrics - Rename bfsMaxDepth to computeLongestBfsPath for clarity Made-with: Cursor

packages/cli/src/commands/score/collect-metrics.ts

packages/cli/src/commands/score/hotspots.ts

- Add allOf member count to polymorphismCount for parity with oneOf/anyOf branch counting - Handle RFC 6901 JSON Pointer escaping (~0 → ~ and ~1 → /) in resolveJsonPointer - Remove unused scores parameter from getHotspotReasons Made-with: Cursor

tatomyr

These metrics look like a duplication:

Is it intended?

tatomyr · 2026-04-03T05:57:57Z

packages/cli/src/commands/score/__tests__/document-metrics.test.ts

+async function collect(doc: Record<string, unknown>) {
+  return (await collectDocumentMetrics(doc)).metrics;
+}


This wrapper seems needless and doesn't add to the tests clarity.

tatomyr · 2026-04-03T06:13:33Z

packages/cli/src/commands/score/collectors/document-metrics.ts

+}
+
+export function resetSchemaWalkState(s: SchemaWalkState): void {
+  s.depth = -1; // starts at -1 because the root Schema.enter increments to 0


It seems that resetting the state is the same as createSchemaWalkState. Could it be reused to avoid creating several source of truth?

tatomyr · 2026-04-03T06:15:49Z

packages/cli/src/commands/score/collect-metrics.ts

+    const schemaWalkState = createSchemaWalkState();
+    const schemaVisitor = createSchemaMetricVisitor(schemaWalkState);
+    const normalizedSchemaVisitors = normalizeVisitors(
+      [{ severity: 'warn', ruleId: 'score-schema', visitor: schemaVisitor as any }],


What is the type cast needed for? Is the visitor type wrong?

tatomyr · 2026-04-03T06:20:01Z

packages/cli/src/commands/score/collect-metrics.ts

+  return node;
+}
+
+interface SchemaStats {


This looks similar to SchemaStats in document-metrics.ts but with a wrong type import. Are they the same?

tatomyr · 2026-04-03T06:23:15Z

packages/cli/src/commands/score/collect-metrics.ts

+  if (!ref.startsWith('#/')) return undefined;
+  let node = root;
+  for (const segment of ref.slice(2).split('/')) {
+    const decoded = decodeURIComponent(segment).replace(/~1/g, '/').replace(/~0/g, '~');


It's probably better to reuse unescapePointerFragment from ref-utils.ts.

tatomyr · 2026-04-03T07:25:12Z

packages/cli/src/commands/score/collectors/document-metrics.ts

+  return merged;
+}
+
+function hasExample(mediaType: { example?: unknown; examples?: Record<string, unknown> }): boolean {


Suggested change

function hasExample(mediaType: { example?: unknown; examples?: Record<string, unknown> }): boolean {

function hasExample(mediaType: Oas3MediaType): boolean {

tatomyr · 2026-04-03T07:25:57Z

packages/cli/src/commands/score/collectors/document-metrics.ts

+        mediaType: { schema?: any; example?: unknown; examples?: Record<string, unknown> },
+        _ctx: UserContext


Suggested change

mediaType: { schema?: any; example?: unknown; examples?: Record<string, unknown> },

_ctx: UserContext

mediaType

Looks like you're trying to put as much code as possible where you can perfectly get away with fewer 😄. It works correctly, but the codebase gets bloated and harder to maintain.

tatomyr · 2026-04-03T07:32:32Z

packages/cli/src/commands/score/collect-metrics.ts

+        maxDepth: schemaWalkState.maxDepth,
+        polymorphismCount: schemaWalkState.polymorphismCount,
+        anyOfCount: schemaWalkState.anyOfCount,
+        hasDiscriminator: schemaWalkState.hasDiscriminator,
+        propertyCount: schemaWalkState.propertyCount,
+        totalSchemaProperties: schemaWalkState.totalSchemaProperties,
+        schemaPropertiesWithDescription: schemaWalkState.schemaPropertiesWithDescription,
+        constraintCount: schemaWalkState.constraintCount,
+        hasPropertyExamples: schemaWalkState.hasPropertyExamples,
+        writableTopLevelFields: schemaWalkState.writableTopLevelFields,


Suggested change

maxDepth: schemaWalkState.maxDepth,

polymorphismCount: schemaWalkState.polymorphismCount,

anyOfCount: schemaWalkState.anyOfCount,

hasDiscriminator: schemaWalkState.hasDiscriminator,

propertyCount: schemaWalkState.propertyCount,

totalSchemaProperties: schemaWalkState.totalSchemaProperties,

schemaPropertiesWithDescription: schemaWalkState.schemaPropertiesWithDescription,

constraintCount: schemaWalkState.constraintCount,

hasPropertyExamples: schemaWalkState.hasPropertyExamples,

writableTopLevelFields: schemaWalkState.writableTopLevelFields,

...schemaWalkState

Could it be simplified like so?

tatomyr · 2026-04-03T07:39:01Z

packages/cli/src/commands/score/collectors/document-metrics.ts

+  };
+}
+
+function buildOperationMetrics(ctx: CurrentOperationContext): OperationMetrics {


What is the difference between those two types? It looks like they are either the same or could be reused, so this function could be avoided or at least shortened.

tatomyr · 2026-04-03T09:35:39Z

packages/cli/src/commands/score/collect-metrics.ts

+  const seen = new WeakSet<object>();
+
+  function walk(node: any): void {
+    if (!node || typeof node !== 'object' || seen.has(node)) return;


Suggested change

if (!node || typeof node !== 'object' || seen.has(node)) return;

if (!isPlainObject(node) || seen.has(node)) return;

Was this the intention? Or you also want to traverse arrays?

Another question--could nodes be $refs? If yes, you have to resolve them. On the other hand, you already have walkSchemaRaw function which walks through schemas, maybe you can reuse it, but with a different visitor?

adamaltman · 2026-04-04T12:44:05Z

Yes, I think there is too much overlap between the two scores so I'm going to consolidate it into one. It will make docs easier, result understanding easier, etc... Addressing other comments too.

… single score Merge the two overlapping composite scores into a single "Agent Readiness" score with all 9 subscores. This eliminates duplication in weights, scoring functions, and output. Also addresses code review feedback: use unescapePointerFragment, simplify resetSchemaWalkState via Object.assign, consolidate SchemaStats into types.ts, remove collect() test wrapper, simplify hasExample return, and use spread for walkSchemaRaw return. Made-with: Cursor

cursor

Cursor Bugbot has reviewed your changes and found 3 potential issues.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 6672b0c. Configure here.}

cursor · 2026-04-04T13:04:04Z

packages/cli/src/commands/score/collect-metrics.ts

+
+  walk(parsed);
+  return removed;
+}


Nested oneOf/anyOf silently lost by aggressive stripping

Medium Severity

stripCompositionKeywords removes oneOf/anyOf from every schema in the document tree, but walkSchema only restores and handles composition for schemas it directly processes (top-level request/response schemas and their direct composition branches). Schemas nested within properties or items that originally had oneOf/anyOf are walked by walkSchemaRaw, which sees the stripped version — so their branches, property counts, depth, and polymorphism are silently invisible. This undercounts polymorphismCount, anyOfCount, propertyCount, and maxDepth for any API using nested polymorphism, producing artificially optimistic scores.

Additional Locations (1)

packages/cli/src/commands/score/collect-metrics.ts#L175-L290

^{Reviewed by Cursor Bugbot for commit 6672b0c. Configure here.}

cursor · 2026-04-04T13:04:04Z

packages/cli/src/commands/score/constants.ts

+    maxParamsGood: 5,
+    maxDepthGood: 4,
+    maxPolymorphismGood: 2,
+    maxPropertiesGood: 20,


Threshold maxPropertiesGood defined but never used in scoring

Low Severity

maxPropertiesGood is declared in ScoringThresholds, set to 20 in DEFAULT_SCORING_CONSTANTS, but never referenced in any scoring computation in scoring.ts or elsewhere. This is dead code that may confuse future maintainers into thinking property count is factored into the score when it isn't.

Additional Locations (1)

packages/cli/src/commands/score/types.ts#L80-L81

^{Reviewed by Cursor Bugbot for commit 6672b0c. Configure here.}

cursor · 2026-04-04T13:04:04Z

packages/cli/src/commands/score/formatters/stylish.ts

+  const sorted = [...values].sort((a, b) => a - b);
+  const mid = Math.floor(sorted.length / 2);
+  return sorted.length % 2 !== 0 ? sorted[mid] : (sorted[mid - 1] + sorted[mid]) / 2;
+}


Duplicate median utility function across two files

Low Severity

An identical median function is defined as a private function in both scoring.ts and formatters/stylish.ts. This duplicated logic increases the risk of inconsistent bug fixes if the implementation ever needs to change.

Additional Locations (1)

packages/cli/src/commands/score/scoring.ts#L189-L195

^{Reviewed by Cursor Bugbot for commit 6672b0c. Configure here.}

feat: add score command

9e40ca1

adamaltman requested review from a team as code owners March 12, 2026 01:21

RomanHotsiy reviewed Mar 12, 2026

View reviewed changes

adamaltman added 2 commits March 11, 2026 20:39

chore: add more tests

2c19efb

adamaltman force-pushed the aa/api-score branch from 79078d6 to 0e328aa Compare March 12, 2026 02:00

adamaltman and others added 3 commits March 11, 2026 21:02

fix: add missing version property to test CommandArgs

ded736c

Made-with: Cursor

chore: add e2e

05bd98c

chore: refactor the visitor to use native pattern

ddfdf10

JLekawa reviewed Mar 12, 2026

View reviewed changes

docs/@v2/commands/score.md Outdated Show resolved Hide resolved

.changeset/add-score-command.md Outdated Show resolved Hide resolved

chore: fix semantic line breaks

1acac45

Co-authored-by: Jacek Łękawa <164185257+JLekawa@users.noreply.github.com>

tatomyr reviewed Mar 12, 2026

View reviewed changes

adamaltman added 2 commits March 13, 2026 09:56

chore: refactor based on feedback

6cd3af4

fix: adjust walking method and add debugging arguments

92d40f7

tatomyr requested changes Mar 16, 2026

View reviewed changes

adamaltman added 2 commits March 24, 2026 06:58

adamaltman commented Mar 24, 2026

View reviewed changes

packages/cli/src/commands/score/__tests__/document-metrics.test.ts Outdated Show resolved Hide resolved

adamaltman added 3 commits March 24, 2026 07:42

chore: add median to raw metrics summary and fix label

2d33420

Add median alongside averages for parameters, schema depth, polymorphism, and properties in the stylish output. Rename the misleading "Avg max schema depth" to "Schema depth". Made-with: Cursor

tatomyr reviewed Mar 24, 2026

View reviewed changes

packages/cli/src/commands/score/collect-metrics.ts Outdated Show resolved Hide resolved

packages/cli/src/commands/score/__tests__/formatters.test.ts Outdated Show resolved Hide resolved

packages/cli/src/commands/score/__tests__/hotspots.test.ts Outdated Show resolved Hide resolved

chore: address PR review feedback

d6175e2

- Use isPlainObject from openapi-core for schema cycle detection instead of raw typeof checks that match arrays - Remove formatter unit tests in favor of e2e snapshot coverage - Clarify makeScores() purpose in hotspot tests Made-with: Cursor

adamaltman requested a review from tatomyr March 30, 2026 16:33

tatomyr reviewed Mar 31, 2026

View reviewed changes

docs/@v2/commands/score.md Show resolved Hide resolved

tatomyr reviewed Mar 31, 2026

View reviewed changes

docs/@v2/commands/score.md Outdated Show resolved Hide resolved

tatomyr reviewed Mar 31, 2026

View reviewed changes

cursor bot reviewed Apr 1, 2026

View reviewed changes

cursor bot reviewed Apr 2, 2026

View reviewed changes

packages/cli/src/commands/score/collect-metrics.ts Show resolved Hide resolved

packages/cli/src/commands/score/collect-metrics.ts Show resolved Hide resolved

packages/cli/src/commands/score/hotspots.ts Outdated Show resolved Hide resolved

tatomyr reviewed Apr 3, 2026

View reviewed changes

cursor bot reviewed Apr 4, 2026

View reviewed changes

	function hasExample(mediaType: { example?: unknown; examples?: Record<string, unknown> }): boolean {
	function hasExample(mediaType: Oas3MediaType): boolean {

		mediaType: { schema?: any; example?: unknown; examples?: Record<string, unknown> },
		_ctx: UserContext

	mediaType: { schema?: any; example?: unknown; examples?: Record<string, unknown> },
	_ctx: UserContext
	mediaType

	if (!node \|\| typeof node !== 'object' \|\| seen.has(node)) return;
	if (!isPlainObject(node) \|\| seen.has(node)) return;

Conversation

adamaltman commented Mar 12, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What/Why/How?

Reference

Testing

Screenshots (optional)

Check yourself

Security

Uh oh!

changeset-bot bot commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

github-actions bot commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Coverage Report

Uh oh!

github-actions bot commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tatomyr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tatomyr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tatomyr left a comment

Choose a reason for hiding this comment

Uh oh!

adamaltman commented Mar 12, 2026 •

edited by cursor bot

Loading

changeset-bot bot commented Mar 12, 2026 •

edited

Loading

github-actions bot commented Mar 12, 2026 •

edited

Loading

github-actions bot commented Mar 12, 2026 •

edited

Loading

cursor bot left a comment •

edited

Loading

Threshold `maxPropertiesGood` defined but never used in scoring

Duplicate `median` utility function across two files