Skip to content

feat: design outside voices — cross-model design critique (v0.11.3.0)#347

Merged
garrytan merged 6 commits intomainfrom
garrytan/codex-design-voice
Mar 23, 2026
Merged

feat: design outside voices — cross-model design critique (v0.11.3.0)#347
garrytan merged 6 commits intomainfrom
garrytan/codex-design-voice

Conversation

@garrytan
Copy link
Owner

Summary

  • Parallel Codex + Claude subagent design review in /plan-design-review, /design-review, /design-consultation — cross-model litmus scorecard with 7 yes/no checks + 7 hard rejection criteria
  • OpenAI "Designing Delightful Frontends" framework integrated: classifier (landing page vs app UI vs hybrid), hard rules per type, shared AI slop blacklist
  • Codex design voice in DESIGN_REVIEW_LITE — lightweight design check in every /ship and /review when frontend files change
  • Outside voices in /office-hours — design perspectives on brainstormed approaches after wireframe sketches
  • DRY refactor — AI slop blacklist extracted into shared AI_SLOP_BLACKLIST constant, referenced by both generateDesignMethodology() and generateDesignHardRules()
  • 18 new resolver tests covering host guards, skillName branching, classifier step, rule sets

Pre-Landing Review

No issues found — template/infrastructure change only.

Test Coverage

All 549 tests pass. 18 new test cases added for the new resolvers.

TODOS

2 new deferred items added:

  • Outside voices in /plan-eng-review (P3, depends on design beachhead learnings)
  • Outside voices in /qa visual regression detection (P3, depends on /qa design awareness)

Test plan

  • bun run gen:skill-docs regenerates without errors
  • bun test — all gen-skill-docs + skill validation + touchfile tests pass
  • Generated SKILL.md files contain litmus scorecard, hard rules, classifier at correct locations
  • --host codex produces empty outside voices steps (no self-invocation)

🤖 Generated with Claude Code

garrytan and others added 6 commits March 22, 2026 19:18
Add generateDesignOutsideVoices() — parallel Codex + Claude subagent
dispatch for cross-model design critique with litmus scorecard synthesis.
Branches per skillName (plan-design-review, design-review, design-consultation)
with task-specific reasoning effort (high for analytical, medium for creative).

Add generateDesignHardRules() — OpenAI Frontend Skill hard rules + gstack
AI slop blacklist unified into one shared block with classifier step
(landing page vs app UI vs hybrid).

Extract AI_SLOP_BLACKLIST constant from inline prose in generateDesignMethodology()
for DRY. Extend generateDesignReviewLite() with lightweight Codex block.
Extend generateDesignSketch() with outside voices opt-in after wireframe.

Source: OpenAI "Designing Delightful Frontends with GPT-5.4" (Mar 2026)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
…mplates

Insert {{DESIGN_OUTSIDE_VOICES}} in plan-design-review (between Step 0D
and Pass 1), design-review (between Phase 6 and Phase 7), and
design-consultation (between Phase 2 and Phase 3).

Insert {{DESIGN_HARD_RULES}} in plan-design-review Pass 4 and design-review
Phase 3 checklist.

DESIGN_REVIEW_LITE in /ship and /review now includes a Codex design voice
block with litmus checks.

DESIGN_SKETCH in /office-hours now includes outside voices opt-in after
wireframe approval.

Regenerated all SKILL.md files (both Claude and Codex hosts).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Add 18 test cases across 4 new describe blocks:
- DESIGN_OUTSIDE_VOICES: host guard, skillName branching, reasoning effort
- DESIGN_HARD_RULES: classifier, 3 rule sets, slop blacklist, OpenAI criteria
- DESIGN_SKETCH extended: outside voices step, original wireframe preserved
- DESIGN_REVIEW_LITE extended: Codex block, codex host exclusion

Update touchfiles: add scripts/gen-skill-docs.ts to design skill E2E
test dependencies for accurate diff-based test selection.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Design outside voices — parallel Codex + Claude subagent for cross-model
design critique with litmus scorecard synthesis. OpenAI hard rules + gstack
slop blacklist unified. Classifier for landing page vs app UI.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
….2.0)

.agents/ is gitignored since v0.11.2.0 — tests that read Codex-host
SKILL.md files now generate them on demand via `bun run gen-skill-docs.ts
--host codex` before reading. Fixes test failures on fresh clones.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@garrytan garrytan merged commit 4cd4d11 into main Mar 23, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant