docs: restructure agent and contributor documentation (plan 427, PR 1)#454
Open
docs: restructure agent and contributor documentation (plan 427, PR 1)#454
Conversation
Restructure AGENTS.md from ~627 lines to ~55 lines of high-signal architectural invariants. Extract code style into STYLEGUIDE.md and development workflow into DEVELOPMENT.md. Overhaul CONTRIBUTING.md to reflect agent-assisted development as the primary workflow. Move skills and sub-agents from .claude/ to .agents/ as the tool-agnostic home, with symlinks back for Claude Code compatibility. Add architecture/ skeleton with 10 stub files for incremental population. Implements PR 1 of #427. Made-with: Cursor
The new-sdg skill is superseded by skills/data-designer/, which is the proper usage skill for building datasets. Update .agents/README.md to reference the usage skill's actual location. Made-with: Cursor
Add docstring conventions (Google style), Pydantic/dataclass guidance, error handling patterns, and f-string preference to STYLEGUIDE.md. Clarify per-package test targets, flat test style, e2e API key requirement, notebook regeneration commands, and import perf threshold in DEVELOPMENT.md. Point dataset-building agents to the data-designer skill in AGENTS.md and clarify dependency direction arrows. Made-with: Cursor
Made-with: Cursor
Contributor
Greptile SummaryThis PR restructures DataDesigner's developer documentation by splitting a bloated 626-line
|
| Filename | Overview |
|---|---|
| AGENTS.md | Rewritten from ~626 lines to ~56 lines. Accurately describes layering, dependency direction, core concepts, and structural invariants. Correctly references STYLEGUIDE.md and DEVELOPMENT.md for details. No issues found. |
| STYLEGUIDE.md | New comprehensive style guide extracted from old AGENTS.md with added sections on docstrings, Pydantic/dataclass patterns, and error handling. The previously flagged SIM enforcement note was corrected in eb5315b. Active linter rules section accurately reflects pyproject.toml. No issues found. |
| DEVELOPMENT.md | New development guide covering prerequisites, setup, workflow, testing patterns, pre-commit hooks, and import performance thresholds. Content is accurate and well-organized. No issues found. |
| CONTRIBUTING.md | Overhauled to focus on agent-assisted contribution workflow. Previously flagged copyright year inconsistency was corrected in eb5315b. Streamlined significantly from the previous ~236 lines. Correctly links to issue templates and new doc structure. |
| .agents/README.md | New file clearly documents the .agents/ directory structure, symlink targets, and usage scope (development vs. end-user). Accurate and concise. |
| .agents/skills/review-code/SKILL.md | Updated Step 2 to load all three new docs (AGENTS.md, STYLEGUIDE.md, DEVELOPMENT.md) rather than the single old AGENTS.md. The split is logical and accurate. No issues found. |
| .claude/agents | Symlink pointing to ../.agents/agents. Relative target resolves correctly from .claude/ to the repo root and into .agents/agents/. |
| .claude/skills | Symlink pointing to ../.agents/skills. Relative target resolves correctly from .claude/ to the repo root and into .agents/skills/. |
| architecture/overview.md | Correctly marked as a stub with a prominent notice and placeholder sections. Cross-references to sibling architecture docs are accurate. |
| plans/427/agent-first-development-plan.md | Updated delivery strategy to combine Phases 0–2 into this PR. Content is consistent with what was actually delivered. No issues found. |
Flowchart
%%{init: {'theme': 'neutral'}}%%
flowchart TD
CLAUDE["CLAUDE.md\n(@AGENTS.md)"] --> AGENTS["AGENTS.md\nArchitecture · Layering\nCore principles · Invariants"]
AGENTS --> STYLE["STYLEGUIDE.md\nFormatting · Naming · Types\nImports · Docstrings\nPydantic · Error handling"]
AGENTS --> DEV["DEVELOPMENT.md\nSetup · Workflow · Testing\nPre-commit · Import perf"]
AGENTS --> ARCH["architecture/\n10 stubs (Phase 3)"]
CONTRIB["CONTRIBUTING.md\nAgent-assisted workflow\nIssues · PRs · Reviews"] --> DEV
SKILL["review-code SKILL.md\n(Step 2: load all three)"] --> AGENTS
SKILL --> STYLE
SKILL --> DEV
subgraph ".agents/ (canonical)"
AG2["agents/\ndocs-searcher\ngithub-searcher"]
SK2["skills/\ncommit · create-pr\nreview-code · update-pr\nsearch-docs · search-github"]
end
subgraph ".claude/ (symlinks)"
CLA["agents → ../.agents/agents"]
CLS["skills → ../.agents/skills"]
end
CLA -.->|symlink| AG2
CLS -.->|symlink| SK2
Reviews (5): Last reviewed commit: "Merge branch 'main' into nmulepati/docs/..." | Re-trigger Greptile
Add plan document step, self-review with multi-model passes, automated CI review expectations, and comment resolution protocol. Made-with: Cursor
Move architecture doc population from deferred/incremental to PR 2 since the subsystems already exist. Update plan delivery strategy, execution order, and out-of-scope sections accordingly. Made-with: Cursor
…ibuting Replace pd.DataFrame with list[dict[str, str]] in naming example to avoid contradicting lazy-import guidance in the same file. Soften "enforced by SIM" to note SIM rules are not yet enabled in CI. Fix upstream sync instructions for fork-based contributors. Update copyright year in CONTRIBUTING.md from 2025 to 2026 to match STYLEGUIDE.md. Made-with: Cursor
Contributor
Author
|
All findings addressed in eb5315b:
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
📋 Summary
Restructure DataDesigner's documentation to clearly separate concerns: a concise architectural guide for agents (
AGENTS.md), a comprehensive code style reference (STYLEGUIDE.md), a contributor-focused development guide (DEVELOPMENT.md), and an updated contribution guide (CONTRIBUTING.md). This is PR 1 of the agent-assisted development plan (plan #427), covering Phases 0–2.🔄 Changes
✨ Added
STYLEGUIDE.md— New comprehensive code style guide extracted from the oldAGENTS.md, expanded with:Args:,Returns:,Raises:,Attributes:)ConfigBasepatterns, validator naming)raise ... from exc, boundary wrapping, canonical error types)DEVELOPMENT.md— New development guide extracted from the oldAGENTS.mdandCONTRIBUTING.md, including:make test-config,make test-engine,make test-interface)make convert-execute-notebooks,make generate-colab-notebooks).agents/README.md— Documents the.agents/directory structure, symlink compatibility, and development-vs-usage scopearchitecture/— 10 stub architecture documents (overview.md,config.md,engine.md,models.md,mcp.md,dataset-builders.md,sampling.md,cli.md,agent-introspection.md,plugins.md) ready for Phase 3 content🔧 Changed
AGENTS.md— Rewritten from ~626 lines to ~56 lines as a focused architectural guide: identity, layering, core concepts, design principles, structural invariants, and development pointersCONTRIBUTING.md— Overhauled to focus on agent-assisted contribution workflow, referencing the new doc structureREADME.md— Added brief mention of agent-assisted development.agents/skills/review-code/SKILL.md— Updated to reference the new three-file doc structure (AGENTS.md,STYLEGUIDE.md,DEVELOPMENT.md)plans/427/agent-first-development-plan.md— Updated delivery strategy to combine Phases 0–2 into PR 1🗑️ Removed
.claude/skills/new-sdg/— Obsolete prototyping skill, superseded byskills/data-designer/🏗️ Restructured
.claude/skills/and.claude/agents/— Canonical files moved to.agents/skills/and.agents/agents/;.claude/directories replaced with symlinks for backward compatibility🔍 Attention Areas
AGENTS.md— This is the primary file agents read on every interaction. Verify the layering description, dependency direction, and structural invariants are accurate..claude/agentsand.claude/skills— These are now symlinks to.agents/. Verify they resolve correctly in your local checkout.STYLEGUIDE.md— New sections on docstrings, Pydantic/dataclass conventions, and error handling should be validated against current codebase patterns.🤖 Generated with AI
PR-1 for #427