Structured behavioral specifications for the editorial skills in this project. Each scenario describes an expected behavior in Given/When/Then format, derived from the constraints, done criteria, and acceptance rules in the skill files and AGENTS.md.
These scenarios serve two audiences:
- Human editors — readable specifications of what "correct" looks like for each skill. Use them to evaluate whether an AI-generated output meets editorial standards before accepting it.
- AI agents — machine-readable expectations that can be checked against actual output. When a skill's done criteria say "every procedure includes rollback," these scenarios spell out exactly what that means.
| Directory | Skill | What it covers |
|---|---|---|
security-researcher/ |
security-researcher |
Source grounding, vendor/editorial separation, Veloria verification |
wordpress-runbook-ops/ |
wordpress-runbook-ops |
Procedure schema, destructive command safety, WP-CLI validity, code fences |
wordpress-security-doc-editor/ |
wordpress-security-doc-editor |
Authority hierarchy, terminology, cross-document alignment, benchmark structure |
cross-skill/ |
Multiple skills | Audit workflow, synthesis, Style Guide protection, version consistency |
Every .md file in this directory follows the same structure:
# <Scenario Title>
Skill: <skill-name>
Agent(s): <AGENTS.md agent role(s)>
## <Scenario Name>
**Given** <precondition>
**When** <action or trigger>
**Then** <expected outcome>
### Examples
<concrete examples of pass/fail cases>Rules:
- One file per behavioral concern. Keep scenarios focused.
- Each file may contain multiple related scenarios under the same concern.
- Examples use concrete, realistic content from the WordPress security domain.
- Pass/fail examples make the boundary between acceptable and unacceptable output unambiguous.
Before accepting AI-generated output, check the relevant scenarios:
- Identify which skill produced the output.
- Read the scenario files for that skill.
- Verify each applicable scenario's Then-clause against the actual output.
- Flag any failures as revision requests.
When modifying a SKILL.md or adding new constraints:
- Write the scenario first (the behavioral spec).
- Update the skill to enforce the new behavior.
- Verify existing scenarios still pass.
The scenarios in cross-skill/ define behaviors that span multiple skills and agents. These are particularly useful during cross-document revision rounds (AGENTS.md, section 5) where multiple agents collaborate on a single revision plan.
The test-runs/ directory contains documented results of BDD cycle tests — cases where a skill was used to generate output and then each applicable scenario was checked against the result with pass/fail evidence.
Test runs serve as validation that the scenarios catch real defects and as a record of scenario coverage gaps worth addressing.
| Date | Skill | Procedure | Score |
|---|---|---|---|
| 2026-03-11 | wordpress-runbook-ops |
Domain migration (URL search-replace) | 10/11 |
| File | Role |
|---|---|
AGENTS.md |
Defines agent roles and guardrails. Scenarios operationalize the acceptance criteria from section 6. |
wp-docs-skills/*/SKILL.md |
Defines skill constraints and done criteria. Scenarios expand these into testable expectations. |
docs/current-metrics.md |
Tracks project counts. Update when adding new scenario files. |