docs: restructure agent and contributor documentation (plan 427, PR 1)#454
Merged
docs: restructure agent and contributor documentation (plan 427, PR 1)#454
Conversation
Restructure AGENTS.md from ~627 lines to ~55 lines of high-signal architectural invariants. Extract code style into STYLEGUIDE.md and development workflow into DEVELOPMENT.md. Overhaul CONTRIBUTING.md to reflect agent-assisted development as the primary workflow. Move skills and sub-agents from .claude/ to .agents/ as the tool-agnostic home, with symlinks back for Claude Code compatibility. Add architecture/ skeleton with 10 stub files for incremental population. Implements PR 1 of #427. Made-with: Cursor
The new-sdg skill is superseded by skills/data-designer/, which is the proper usage skill for building datasets. Update .agents/README.md to reference the usage skill's actual location. Made-with: Cursor
Add docstring conventions (Google style), Pydantic/dataclass guidance, error handling patterns, and f-string preference to STYLEGUIDE.md. Clarify per-package test targets, flat test style, e2e API key requirement, notebook regeneration commands, and import perf threshold in DEVELOPMENT.md. Point dataset-building agents to the data-designer skill in AGENTS.md and clarify dependency direction arrows. Made-with: Cursor
Made-with: Cursor
Contributor
Greptile SummaryThis PR restructures DataDesigner's documentation into four focused files — a slim architectural guide ( Key changes:
|
| Filename | Overview |
|---|---|
| AGENTS.md | Refactored from ~626 lines to ~56 lines — now a focused architectural guide covering layering, core concepts, design principles, structural invariants, and pointers to STYLEGUIDE.md / DEVELOPMENT.md. PEP 420 namespace package reference is accurate. |
| STYLEGUIDE.md | New comprehensive style reference extracted and expanded from old AGENTS.md. Adds Google-style docstring conventions, Pydantic/dataclass guidance, error handling patterns, and nested-function avoidance. SIM enforcement wording correctly softened to "not yet enforced by CI" (per prior review fix in eb5315b). |
| DEVELOPMENT.md | New development guide with local setup, workflow, and testing documentation. All referenced Makefile targets (test-config, test-engine, test-interface, test-e2e, convert-execute-notebooks, generate-colab-notebooks, perf-import) confirmed present in the Makefile. CI threshold (3-second average, 1 cold + 4 warm import cycles) is well documented. |
| CONTRIBUTING.md | Overhauled to focus on agent-assisted contribution workflow. Previous copyright year inconsistency (2025 vs 2026) was resolved in eb5315b. Traditional bug report / feature request templates are removed, but this appears intentional per the agent-first plan — Phase 3 will add GitHub issue templates. |
| .agents/skills/review-code/SKILL.md | Updated to reference the new three-file doc structure (AGENTS.md, STYLEGUIDE.md, DEVELOPMENT.md) in the "Load Project Guidelines" step, replacing the monolithic AGENTS.md-only reference. |
| .claude/agents | Now a symlink (mode 120000) pointing to ../.agents/agents. Old directory contents moved to .agents/agents/ first. Symlink target path is correct relative to .claude/. |
| .claude/skills | Now a symlink (mode 120000) pointing to ../.agents/skills. Previous new-sdg skill removed (superseded by skills/data-designer/). Symlink target path is correct relative to .claude/. |
| plans/427/agent-first-development-plan.md | Delivery strategy updated to combine Phases 0–2 into PR 1. New execution order table is well-structured with authoring sequence notes. Architecture stubs scope clarified (content to be populated in PR 2). |
Flowchart
%%{init: {'theme': 'neutral'}}%%
flowchart TD
subgraph Agent["Agent Entry Points"]
A1[".claude/agents (symlink)"]
A2[".claude/skills (symlink)"]
end
subgraph Canonical[".agents/ (Canonical Source)"]
B1[".agents/agents/"]
B2[".agents/skills/"]
end
subgraph DocFiles["Root Documentation Files"]
D1["AGENTS.md\n(~56 lines: arch + invariants)"]
D2["STYLEGUIDE.md\n(style, types, imports, patterns)"]
D3["DEVELOPMENT.md\n(setup, testing, workflow)"]
D4["CONTRIBUTING.md\n(agent-assisted contribution workflow)"]
end
subgraph Arch["architecture/ (10 stubs)"]
E1["overview.md"]
E2["config.md / engine.md / models.md"]
E3["cli.md / mcp.md / plugins.md / ..."]
end
A1 -->|symlink| B1
A2 -->|symlink| B2
B2 --> |review-code SKILL.md loads| D1
B2 --> |review-code SKILL.md loads| D2
B2 --> |review-code SKILL.md loads| D3
D1 --> |"links to"| D2
D1 --> |"links to"| D3
D1 --> |"links to"| Arch
D4 --> |"references"| D1
D4 --> |"references"| D2
D4 --> |"references"| D3
Reviews (6): Last reviewed commit: "Merge branch 'main' into nmulepati/docs/..." | Re-trigger Greptile
Add plan document step, self-review with multi-model passes, automated CI review expectations, and comment resolution protocol. Made-with: Cursor
Move architecture doc population from deferred/incremental to PR 2 since the subsystems already exist. Update plan delivery strategy, execution order, and out-of-scope sections accordingly. Made-with: Cursor
…ibuting Replace pd.DataFrame with list[dict[str, str]] in naming example to avoid contradicting lazy-import guidance in the same file. Soften "enforced by SIM" to note SIM rules are not yet enabled in CI. Fix upstream sync instructions for fork-based contributors. Update copyright year in CONTRIBUTING.md from 2025 to 2026 to match STYLEGUIDE.md. Made-with: Cursor
Contributor
Author
|
All findings addressed in eb5315b:
|
johnnygreco
approved these changes
Mar 25, 2026
andreatgretel
approved these changes
Mar 25, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
📋 Summary
Restructure DataDesigner's documentation to clearly separate concerns: a concise architectural guide for agents (
AGENTS.md), a comprehensive code style reference (STYLEGUIDE.md), a contributor-focused development guide (DEVELOPMENT.md), and an updated contribution guide (CONTRIBUTING.md). This is PR 1 of the agent-assisted development plan (plan #427), covering Phases 0–2.🔄 Changes
✨ Added
STYLEGUIDE.md— New comprehensive code style guide extracted from the oldAGENTS.md, expanded with:Args:,Returns:,Raises:,Attributes:)ConfigBasepatterns, validator naming)raise ... from exc, boundary wrapping, canonical error types)DEVELOPMENT.md— New development guide extracted from the oldAGENTS.mdandCONTRIBUTING.md, including:make test-config,make test-engine,make test-interface)make convert-execute-notebooks,make generate-colab-notebooks).agents/README.md— Documents the.agents/directory structure, symlink compatibility, and development-vs-usage scopearchitecture/— 10 stub architecture documents (overview.md,config.md,engine.md,models.md,mcp.md,dataset-builders.md,sampling.md,cli.md,agent-introspection.md,plugins.md) ready for Phase 3 content🔧 Changed
AGENTS.md— Rewritten from ~626 lines to ~56 lines as a focused architectural guide: identity, layering, core concepts, design principles, structural invariants, and development pointersCONTRIBUTING.md— Overhauled to focus on agent-assisted contribution workflow, referencing the new doc structureREADME.md— Added brief mention of agent-assisted development.agents/skills/review-code/SKILL.md— Updated to reference the new three-file doc structure (AGENTS.md,STYLEGUIDE.md,DEVELOPMENT.md)plans/427/agent-first-development-plan.md— Updated delivery strategy to combine Phases 0–2 into PR 1🗑️ Removed
.claude/skills/new-sdg/— Obsolete prototyping skill, superseded byskills/data-designer/🏗️ Restructured
.claude/skills/and.claude/agents/— Canonical files moved to.agents/skills/and.agents/agents/;.claude/directories replaced with symlinks for backward compatibility🔍 Attention Areas
AGENTS.md— This is the primary file agents read on every interaction. Verify the layering description, dependency direction, and structural invariants are accurate..claude/agentsand.claude/skills— These are now symlinks to.agents/. Verify they resolve correctly in your local checkout.STYLEGUIDE.md— New sections on docstrings, Pydantic/dataclass conventions, and error handling should be validated against current codebase patterns.🤖 Generated with AI
PR-1 for #427