TheWizardsCode
diff --git a/‎content/config/batch_sweeps.yml‎
Lines changed: 72 additions & 0 deletions b/‎content/config/batch_sweeps.yml‎
Lines changed: 72 additions & 0 deletions
diff --git a/‎docs/gengine/ai_tournament_and_balance_analysis.md‎
Lines changed: 132 additions & 11 deletions b/‎docs/gengine/ai_tournament_and_balance_analysis.md‎
Lines changed: 132 additions & 11 deletions
@@ -0,0 +1,72 @@
+# Batch Simulation Sweep Configuration
+# Defines parameter ranges for multi-dimensional simulation sweeps.
+# Used by scripts/run_batch_sweeps.py for balance analysis and regression testing.
+
+# Parameter Grid - each combination is tested (Cartesian product)
+parameters:
+  # AI strategies to test
+  strategies:
+    - balanced
+    - aggressive
+    - diplomatic
+
+  # Difficulty presets (maps to content/config/sweeps/difficulty-<preset>/)
+  difficulties:
+    - normal
+
+  # Random seeds for deterministic reproducibility
+  # Can be explicit list or range definition
+  seeds:
+    - 42
+    - 123
+    - 456
+
+  # World bundles to test (from content/worlds/)
+  worlds:
+    - default
+
+  # Tick budgets for simulation length
+  tick_budgets:
+    - 100
+
+# Parallel execution settings
+parallel:
+  # Maximum worker processes (null = auto-detect based on CPU count)
+  max_workers: null
+
+  # Timeout per individual sweep in seconds
+  timeout_per_sweep: 300
+
+# Output configuration
+output:
+  # Directory for sweep result JSON files
+  dir: build/batch_sweeps
+
+  # Include full telemetry in output (increases file size)
+  include_telemetry: true
+
+  # Include game state summary in output
+  include_summary: true
+
+# Sampling configuration for large parameter spaces
+# When enabled, samples from the grid instead of full Cartesian product
+sampling:
+  # Sampling mode: "full" (all combinations), "random", "latin_hypercube"
+  mode: full
+
+  # Number of samples to take (only used when mode != "full")
+  sample_count: 100
+
+  # Random seed for sampling reproducibility
+  sample_seed: 42
+
+# Metadata included in every sweep output
+metadata:
+  # Include git commit hash if available
+  include_git_commit: true
+
+  # Include timestamp
+  include_timestamp: true
+
+  # Include runtime environment info
+  include_runtime_info: true
@@ -1,52 +1,173 @@
 # Section 13: AI Tournament & Balance Analysis
 
-**Last Updated:** 2025-12-03
+**Last Updated:** 2025-12-04
 
 ## Overview
-This section describes how to use the AI tournament and balance analysis tooling introduced in Phase 9. These tools help designers and developers run large batches of AI-driven games in parallel, compare strategy performance, and identify balance issues or underutilized content.
+
+This guide explains how to use the AI tournament, batch sweep, and balance analysis tools introduced in Phases 9 and 11. These utilities enable designers and developers to:
+
+- Run large batches of AI-driven games in parallel
+- Compare strategy and difficulty performance
+- Identify balance issues and underutilized content
+- Automate regression and balance testing in CI
 
 ## Running AI Tournaments
 
-The tournament script executes multiple games in parallel, each using a configurable AI strategy (BALANCED, AGGRESSIVE, DIPLOMATIC, HYBRID). Telemetry is captured for each game, and results are aggregated into a single JSON file.
+The tournament script executes multiple games in parallel, each using a configurable AI strategy (`BALANCED`, `AGGRESSIVE`, `DIPLOMATIC`, `HYBRID`). Telemetry is captured for each game, and results are aggregated into a single JSON file for analysis.
 
 **Example:**
 ```bash
 uv run python scripts/run_ai_tournament.py --games 100 --output build/tournament.json
 ```
+
+**Key options:**
 - `--games`: Number of games to run (default: 100)
 - `--output`: Path to save the aggregated results
-- Additional flags allow you to specify strategies, seeds, and world configs.
+- `--strategies`: Strategies to test (e.g., `balanced aggressive`)
+- `--seeds`: Random seeds for reproducibility
+- `--worlds`: World configuration bundles
+
+## Running Batch Simulation Sweeps
+
+The batch sweep script (Phase 11, M11.1) enables multi-dimensional parameter space exploration for comprehensive balance analysis. It generates a Cartesian product of parameter combinations and executes them in parallel, allowing you to:
+
+- Stress-test balance across strategies, difficulties, seeds, worlds, and tick budgets
+- Sample large parameter spaces efficiently
+- Aggregate results for statistical analysis
+
+### Configuration
+
+Batch sweeps are configured via `content/config/batch_sweeps.yml`. You can override any parameter via CLI flags.
+
+```yaml
+parameters:
+  strategies:
+    - balanced
+    - aggressive
+    - diplomatic
+  difficulties:
+    - normal
+    - hard
+  seeds:
+    - 42
+    - 123
+    - 456
+  worlds:
+    - default
+  tick_budgets:
+    - 100
+    - 200
+
+parallel:
+  max_workers: null  # Auto-detect CPU count
+  timeout_per_sweep: 300
+
+output:
+  dir: build/batch_sweeps
+  include_telemetry: true
+
+sampling:
+  mode: full  # Options: full, random, latin_hypercube
+  sample_count: 100
+```
+
+**Tip:** For very large parameter spaces, use `sampling.mode: random` and adjust `sample_count` to control the number of sweeps.
+
+### Running Batch Sweeps
+
+**Basic execution with default configuration:**
+```bash
+uv run python scripts/run_batch_sweeps.py --output-dir build/sweeps --verbose
+```
+
+**Override parameters via CLI:**
+```bash
+uv run python scripts/run_batch_sweeps.py \
+  --strategies balanced aggressive \
+  --difficulties normal hard \
+  --seeds 42 123 456 \
+  --ticks 100 200 \
+  --output-dir build/custom_sweeps
+```
+
+**Use a custom configuration file:**
+```bash
+uv run python scripts/run_batch_sweeps.py --config path/to/custom_sweeps.yml
+```
+
+### Output Format
+
+Each sweep produces a JSON file containing:
+- `parameters`: Full parameter set (strategy, difficulty, seed, world, tick_budget)
+- `results`: Game outcome data (final_stability, actions_taken, story_seeds_activated)
+- `telemetry`: Metrics and profiling data (environment, faction_legitimacy, economy)
+- `metadata`: Timestamp, git commit, runtime info
+
+A summary file `batch_sweep_summary.json` aggregates all results, including:
+- Strategy-level statistics (average/min/max stability, win rates)
+- Difficulty-level statistics
+- Total sweep counts and failure rates
+
+### CLI Options
+
+**Common CLI Flags:**
+
+| Flag              | Description                                 |
+|-------------------|---------------------------------------------|
+| `--config, -c`    | Path to YAML configuration file              |
+| `--strategies, -s`| Override strategies to test                  |
+| `--difficulties, -d` | Override difficulty presets               |
+| `--seeds`         | Override random seeds                        |
+| `--worlds, -w`    | Override world bundles                       |
+| `--ticks, -t`     | Override tick budgets                        |
+| `--workers`       | Max parallel workers                         |
+| `--output-dir, -o`| Output directory for results                 |
+| `--json`          | Output summary as JSON                       |
+| `--verbose, -v`   | Print progress during execution              |
+| `--no-write`      | Skip writing individual sweep files          |
 
 ## Analyzing Tournament Results
 
-After running a tournament, use the analysis script to generate comparative reports. This tool surfaces win rate differences, balance anomalies, and unused story seeds.
+After running a tournament or batch sweep, use the analysis script to generate comparative reports. This tool surfaces:
+- Win rate differences across strategies and difficulties
+- Detection of unused story seeds
+- Flagging of balance outliers and anomalies
 
 **Example:**
 ```bash
 uv run python scripts/analyze_ai_games.py build/tournament.json --report build/analysis.txt
 ```
+
+**Key option:**
 - `--report`: Path to save the analysis output
 
 The report includes:
-- Win rate comparison across strategies
+- Win rate comparison across strategies and difficulties
 - Detection of unused story seeds
 - Flagging of balance outliers
 
 ## Balance Iteration Workflow
 
-1. Run a tournament with a large number of games and varied strategies.
-2. Analyze the results to identify dominant strategies, underpowered/overpowered actions, and unused content.
-3. Adjust simulation parameters or authored content as needed.
-4. Repeat the process to validate improvements.
+### Recommended Workflow
+
+1. **Initial Exploration:** Run batch sweeps with diverse parameter combinations to establish baseline metrics.
+2. **Tournament Validation:** Run focused tournaments on specific strategy combinations.
+3. **Analysis:** Use the analysis script to identify dominant strategies, underpowered/overpowered actions, and unused content.
+4. **Adjustment:** Modify simulation parameters or authored content based on findings.
+5. **Regression Testing:** Re-run batch sweeps to validate improvements and ensure no regressions.
 
 ## CI Integration
 
-A nightly CI workflow automatically runs tournaments and archives results for ongoing balance review. See `.github/workflows/ai-tournament.yml` for details.
+A nightly CI workflow automatically runs tournaments and batch sweeps, archiving results for ongoing balance review. See `.github/workflows/ai-tournament.yml` for details.
 
 ## Usage Tips
+
 - Use different world configs and seeds to stress-test balance across scenarios.
+- For large parameter spaces, start with `sampling.mode: random` and a reduced `sample_count`.
 - Review the analysis report regularly to guide design iteration.
 - Archived CI artifacts provide a historical record of balance changes.
+- Use `--verbose` during development to monitor sweep progress.
+- Use reproducible seeds for regression testing.
 
 ## See Also
 - [How to Play Echoes](./how_to_play_echoes.md)