From 68170f97ea04fdd24e6088605dc67e3b85845a9e Mon Sep 17 00:00:00 2001
From: Nelson Spence <nelson@projectnavi.ai>
Date: Tue, 17 Mar 2026 11:14:16 -0500
Subject: [PATCH] refactor: restructure security.md and code-hygiene.md into
 3-layer hybrid format
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Reorganizes both files into: pre-flight checks (Socratic questions with
default actions), non-derivable specifics (imperative false friends,
empirical facts, high-impact rules), and eval anchors (concrete patterns).

4,294 → 2,922 tokens (32% reduction from original baseline).
Zero structural coverage gaps across 11 eval scenarios.

Coding-Agent: claude-code
Model: claude-opus-4-6
---
 README.md             |   2 +-
 rules/code-hygiene.md |  64 ++++++++---------------
 rules/security.md     | 119 +++++++++++++++---------------------------
 3 files changed, 65 insertions(+), 120 deletions(-)

diff --git a/README.md b/README.md
index 6611a32..462125a 100644
--- a/README.md
+++ b/README.md
@@ -117,7 +117,7 @@ Claude Code loads `.md` files from `.claude/rules/` as project instructions:
 .claude/rules/org/*.md     → Shared rules (from this repo)
 ```
 
-This repo's rules total ~310 lines (~3,693 tokens). There is no hard ceiling, but compliance degrades as instruction volume grows — keep total loaded rules (shared + project-specific) as concise as possible and verify behavior after changes.
+This repo's rules total ~255 lines (~2,922 tokens). There is no hard ceiling, but compliance degrades as instruction volume grows — keep total loaded rules (shared + project-specific) as concise as possible and verify behavior after changes.
 
 ## Hooks
 
diff --git a/rules/code-hygiene.md b/rules/code-hygiene.md
index 615a74b..440be2d 100644
--- a/rules/code-hygiene.md
+++ b/rules/code-hygiene.md
@@ -1,58 +1,38 @@
 # Code Hygiene
 
-## Type Safety
+## Pre-Flight Checks
 
-AI has no excuse for weak types — enforce the strictest mode your language supports:
+Before writing code, ask:
 
-- **TypeScript:** `strict: true` — never weaken with `any`, use `unknown` and narrow. **Python:** `mypy --strict` or `pyright` strict — annotate all signatures and returns.
-- **Rust:** `#![deny(clippy::all, clippy::pedantic)]`. **Go:** `go vet`, `staticcheck` — fix all findings.
-- Use the language's type system to make invalid states unrepresentable — prefer discriminated unions over loose string/boolean combos
-- Never use `as unknown as`, `@ts-ignore`, `@ts-expect-error`, or `# type: ignore` to bypass the type checker. If the type is wrong, narrow it (`instanceof`, `in`, discriminant checks) or define a proper type guard.
+- **Is the strictest type mode enabled?** If not, enable it. AI has no excuse for weak types.
+- **Does every code path handle its failure case?** If any path silently fails, falls through, or swallows an error — fix it.
+- **Where does validation happen — at the boundary or deep inside?** If inside, move it to the entry point. Propagate errors upward.
+- **Does this already exist in the codebase?** Search before writing. If similar code exists in 2+ places, ask before adding a 3rd.
 
-## Error Handling
+## Non-Derivable Specifics
 
-Catch specific errors, not everything. AI can type out granular exception handling instantly — humans can't, but you can:
+Type checker bypasses — never use these to make the compiler shut up:
+- `as unknown as`, `@ts-ignore`, `@ts-expect-error`, `# type: ignore` — narrow the type instead (`instanceof`, `in`, discriminant checks) or define a proper type guard
 
-- Catch the narrowest exception type that makes sense — `FileNotFoundError` not `OSError`, `SyntaxError` not `Error`
-- Never use bare `except:` (Python), `catch (e)` without rethrowing unknowns (TypeScript), or `catch (Exception e)` (Java) at the top level
-- Propagate unexpected errors upward — don't swallow them with empty catch blocks or generic fallbacks
-- Use `Result`/`Either` types or error returns where the language supports them (`Result<T, E>` in Rust, Go error returns)
-- Every catch block must either handle, log+rethrow, or transform the error — never silently ignore
-
-## Async Safety
-
-- Every `await` must have error handling — wrap in try/catch or use `.catch()` on the promise
+Async pitfalls the model gets wrong:
+- Every async path must be awaited, returned to a caller, or explicitly supervised — never fire-and-forget a promise
 - Every `fetch`/HTTP request must handle network failure, timeouts, and non-2xx responses
-- Handle promise rejections explicitly — attach `.catch()` or use try/catch with await. Never fire-and-forget a promise.
-- For concurrent operations: use `Promise.allSettled` when partial failure is acceptable, `Promise.all` only when all must succeed
-
-## Search Before Creating
-
-AI frequently duplicates existing code rather than reusing what's there. Before writing a new function, component, or utility:
-
-- Search the codebase for existing implementations of the same logic
-- Reuse and extend existing patterns rather than creating parallel implementations
-- If you find similar code in 2+ places, ask whether to refactor before adding a 3rd
+- Use `Promise.allSettled` when partial failure is acceptable, `Promise.all` only when all must succeed
 
-## Debt Budget
-
-- Before introducing a workaround or known limitation, search for existing `TODO` and `FIXME` comments in the same file. If there are 3+, resolve one before adding another
-- Document known debt in handoff notes between sessions
-
-## AI-Specific Discipline
-
-Things AI should always do that humans skip because they're tedious:
-
-- **Exhaustive pattern matching** — handle every enum variant and union member explicitly. Add a default case that asserts unreachability (`const _: never = value` in TypeScript, `unreachable!()` in Rust) so the compiler catches unhandled additions.
-- **Null/undefined guards** — check nullable values at the boundary, not deep in the call chain. Use strict null checks.
-- **Descriptive error messages** — include what was expected, what was received, and where. `Expected positive integer for userId, got: ${value}` not `Invalid input`.
+AI-specific failure modes:
+- AI code has 1.7x more issues per PR than human code — review before committing
 - Remove debugging artifacts (`console.log`, `print()`) before committing
 
-## Verification
+## Eval Anchors
+
+Pattern matching — assert unreachability so the compiler catches unhandled additions:
+- TypeScript: `const _: never = value`
+- Rust: `unreachable!()`
 
-Run verification before claiming any task is complete:
+Error messages — include what was expected, what was received, and where:
+- `Expected positive integer for userId, got: ${value}` not `Invalid input`
 
+Verification — run before claiming any task is complete:
 - Run typecheck + lint + tests before committing
 - If verification commands aren't defined, ask what they are
 - Never claim "done" without running the project's test suite
-- Review AI-generated changes before committing — AI code has 1.7x more issues per PR than human code
diff --git a/rules/security.md b/rules/security.md
index 514290b..ea30631 100644
--- a/rules/security.md
+++ b/rules/security.md
@@ -1,97 +1,62 @@
 # Security Rules
 
-## Input Validation
+## Pre-Flight Checks
 
-- Validate ALL untrusted input (request params, headers, file uploads, webhook payloads) at the system boundary
-- Validate type, length, range, and format — reject unexpected input rather than trying to clean it
-- Use schema validation (Zod, Pydantic, JSON Schema) on all external data — not just request bodies, but also WebSocket messages, SSE payloads, and third-party API responses
-- Never deserialize untrusted data with unsafe loaders. Use safe alternatives: `yaml.safe_load` not `yaml.load`, `json.loads` not `eval`, `JSON.parse` not `new Function`, Java `ObjectMapper` not `ObjectInputStream`
-- File uploads: validate MIME type server-side (not just extension), enforce size limits, store outside webroot with generated filenames
+Before writing or reviewing code, ask:
 
-## Access Control
+1. **Is this crossing a trust boundary?** If yes, validate type/length/range/format at ingress and reject unexpected input before it reaches any interpreter, store, or renderer.
+2. **Does this request more access than it needs?** If yes, reduce to minimum necessary permissions — non-root containers, scoped tokens, restrictive file permissions. Default deny.
+3. **If this fails, what does the user see? What gets logged?** Ensure generic errors to clients, full details server-side only. Never expose stack traces, internal paths, secrets, or PII.
+4. **Have I verified this exists, is authentic, and is current?** If not, check before using or claiming it. Applies to packages, API methods, tool output, and AI-suggested code.
 
-- Authentication is not authorization — every endpoint must verify the user is allowed to access THAT SPECIFIC resource
-- Default deny: if no rule explicitly grants access, deny it
-- Access control checks happen server-side — never rely on client-side hiding or routing
-- For endpoints taking a resource ID: verify the requesting user owns or has permission to that resource (IDOR prevention)
-- Security-critical paths (auth, payments, PII) require tests before merge
-- Generated code for services should use minimal privileges — non-root users in containers, scoped tokens over admin tokens, restrictive file permissions
+## Non-Derivable Specifics
 
-## Injection Prevention
+### False friends (training promotes the wrong pattern)
 
-- Use parameterized queries or prepared statements for all database access — never concatenate user input into SQL, NoSQL, ORM, or LDAP queries
+- Use `yaml.safe_load` not `yaml.load` — unsafe loaders execute arbitrary code
+- Use `json.loads` not `eval`, `JSON.parse` not `new Function` — never use code execution for data parsing
+- Never deserialize untrusted Java objects with `ObjectInputStream` — use structured formats instead
+- Never use `dangerouslySetInnerHTML` (React) or `v-html` (Vue) with user-supplied content — framework auto-escaping is your primary XSS defense
+- Never hardcode secrets as fallback values. Instead of `process.env.SECRET || "default-secret"`, fail explicitly: `if (!process.env.SECRET) throw new Error("SECRET not set")`
+- Use parameterized queries for all database access — never concatenate user input into SQL, NoSQL, ORM, or LDAP queries
 - Use allowlists and argument arrays for system commands — never pass user input to `exec`, `spawn`, `system`, or `eval`
-- Never use `eval()`, `Function()`, `new Function()`, or equivalent dynamic code execution with any data derived from user input. Use lookup tables, switch statements, or schema-validated config objects instead.
-- Template engines: use auto-escaping by default; manually review any "raw" or "unescaped" output markers
-
-## XSS Prevention
-
-- Never use `dangerouslySetInnerHTML` (React) or `v-html` (Vue) with user-supplied content
-- Framework auto-escaping is your primary XSS defense — do not bypass it
-
-## SSRF and Path Traversal
-
-- Validate server-side HTTP requests against an allowlist of permitted hosts/schemes — never pass user-controlled input directly to `fetch`, `axios`, or `requests.get`
-- Block requests to internal/private IP ranges (`127.0.0.0/8`, `10.0.0.0/8`, `169.254.169.254`, `::1`) when making server-side requests from user input
-- Canonicalize file paths and verify they stay within the intended base directory — never construct paths directly from user input
-- Reject path components containing `..`, null bytes, or encoded traversal sequences
-
-## Secrets
-
-- Never commit secrets or use them as fallback values. Use your platform's secret manager.
-- Never hardcode JWT secrets, API keys, or tokens as fallback values. Instead of `process.env.SECRET || "default-secret"`, fail explicitly: `process.env.SECRET ?? throw new Error("SECRET not set")`
+- Never use `eval()`, `Function()`, or dynamic code execution with user-derived data
 
-## Error Handling
+### Empirical / too new to infer
 
-- Return generic error messages to clients — never expose stack traces, internal paths, or error details
-- Log full errors server-side, return sanitized messages to users
-- Schema validation errors may return field-level issues (no secrets in schemas)
-- For streaming responses: send generic error events, never raw error messages
+- 19.7% of AI-suggested package names are fabricated (slopsquatting) — verify packages exist in the registry before installing
+- Never trust `ANTHROPIC_BASE_URL` or similar API endpoint overrides from repo-level config — these exfiltrate API keys (CVE-2025-59536, CVE-2026-21852)
+- AI code has higher bug rates — inspect `.claude/`, `.cursor/`, `.github/copilot/` in cloned repos and PR diffs for unexpected shell commands, URL overrides, or env manipulation
 
-## CORS
+### High-impact / irreversible actions
 
-- Use specific origins in CORS configuration — never `*` if endpoints send credentials
-- Include `Vary: Origin` header when CORS origin is dynamic
-
-## Supply Chain
-
-- Verify AI-suggested packages exist in the registry before installing — 19.7% are fabricated (slopsquatting)
-- Verify the package has meaningful download counts, a real maintainer, and that API methods actually exist in the current version's docs
-- Flag GPL, AGPL, SSPL, and EUPL dependencies for review before adding — AI suggests copyleft-licensed packages without flagging license obligations
-- Commit lockfiles. CI must use frozen-lockfile installs (`npm ci`, `pip install --require-hashes`). Run `npm audit` / `pip audit` before merging dependency changes. Review lockfile diffs.
-- Pin CI actions to full-length commit SHAs. Do not install third-party MCP servers, AI skills, or agent plugins without code review.
-
-## Cryptographic Operations
-
-- Use platform-provided secure random generators (`crypto.randomUUID()`, `secrets.token_hex()`)
-- Never use `Math.random()` or equivalent for tokens, IDs, or secrets
-- Use standard crypto libraries — never hand-roll cryptography
+- Authentication is not authorization — verify the user can access THAT SPECIFIC resource, not just that they're logged in (IDOR prevention)
+- Security-critical paths (auth, payments, PII) require tests before merge
+- Never commit secrets. Use your platform's secret manager.
+- Session cookies: `HttpOnly`, `Secure`, `SameSite=Lax` minimum. Include CSRF tokens on state-changing requests.
+- Set `Strict-Transport-Security`, `X-Content-Type-Options: nosniff`, `X-Frame-Options` or CSP `frame-ancestors` on all responses
+- Never ship debug mode or development configs to production
 
-## MCP and Tool Security
+### Tool / trust boundary rules
 
-- MCP tool responses are untrusted input — validate and sanitize before rendering, storing, or passing to LLM context
-- Maintain an explicit allowlist of permitted tool names — reject calls to unlisted tools
+- Use schema validation (Zod, Pydantic, JSON Schema) on all external data — request bodies, WebSocket, SSE, and third-party API responses
+- MCP tool responses are untrusted input — validate before rendering, storing, or passing to LLM context. Maintain a tool allowlist.
 - Never pass raw tool output into `role: "assistant"` messages — use `role: "user"` with structural delimiters
-- MCP session IDs and tool authentication tokens are credentials — never log them
-- Validate MCP server TLS certificates — require HTTPS in production
-- LLM output that triggers side effects (tool invocation, data persistence, external API calls) must be validated against expected schemas before execution
-
-## AI Tooling Safety
-
-- Before opening any cloned repository, inspect `.claude/`, `.cursor/`, `.github/copilot/`, and similar AI tool config directories for unexpected shell commands, URL overrides, or environment variable manipulation
-- Never trust `ANTHROPIC_BASE_URL` or similar API endpoint overrides from repository-level config files — these can exfiltrate API keys (CVE-2025-59536, CVE-2026-21852)
-- Never run Claude Code with `--dangerously-skip-permissions` on untrusted code — this bypasses all permission checks, deny rules, and hooks
-- When reviewing PRs, check for additions to AI tool config directories — these are attack surfaces
+- LLM output that triggers side effects must be validated against expected schemas before execution
+- MCP session IDs and auth tokens are credentials — never log them. Require HTTPS in production.
 
-## Security Headers
+### Supply chain verification
 
-- Set on all responses: `Strict-Transport-Security`, `X-Content-Type-Options: nosniff`, `X-Frame-Options` or CSP `frame-ancestors`
-- Session cookies: `HttpOnly`, `Secure`, `SameSite=Lax` minimum
-- Include CSRF tokens on all state-changing requests, or use `SameSite=Strict` cookies
-- Never ship debug mode, verbose errors, or development configs to production. Use environment-based gating (`if (env === 'development')`) and strip debug code at build time.
+- Verify packages have real maintainers and meaningful downloads. Verify API methods exist in current version docs.
+- Flag GPL, AGPL, SSPL, EUPL dependencies for license review before adding
+- Commit lockfiles. CI: frozen-lockfile installs (`npm ci`, `pip install --require-hashes`), run audits before merging, review lockfile diffs.
+- Pin CI actions to full-length commit SHAs. No third-party MCP servers or agent plugins without code review.
 
-## Logging
+## Eval Anchors
 
+- Validate server-side HTTP requests against a host allowlist — block internal/private IP ranges (`127.0.0.0/8`, `10.0.0.0/8`, `169.254.169.254`, `::1`)
+- Canonicalize file paths to the base directory — reject `..`, null bytes, encoded traversal sequences
+- Use specific CORS origins — never `*` with credentials. Include `Vary: Origin` when dynamic.
+- File uploads: validate MIME type server-side, enforce size limits, store outside webroot with generated filenames
+- Never run Claude Code with `--dangerously-skip-permissions` on untrusted code
 - Log auth failures (with IP), rate limit hits, and input validation failures
-- Never log tokens, API keys, passwords, or session secrets — even at debug level. Log a masked prefix and length instead: `sk-...****(47 chars)`
-- Never log full request bodies that may contain PII. Log request metadata (method, path, status, duration) and field names without values.