Address PR #360 AI review round 3: frequency_rank assert + scenario 14 doc cleanup

igerber · claude · igerber · commit daefda78e347 · 2026-04-24T13:49:20.000-04:00
Two findings (1 P2 + 1 P3):

- P2: `_compare_by_path` now asserts `py_path["frequency_rank"] ==
  r_path_entry["frequency_rank"]` for every committed path. Both
  scenarios are constructed with unique path frequencies (scenario 13
  via the mixed_single_switch pattern, scenario 14 via deterministic
  counts 40/25/10/5), so rank ordering is unambiguous and any
  regression in top-k tiebreak handling now fails explicitly instead
  of passing silently as long as the selected path set and per-path
  effects remain correct.

- P3: scenario 14 generator docstring and recorded params still
  described the old stochastic `p_switch`-driven DGP (the pre-PR
  variant that blew out SE parity via cross-path cohort mixing). The
  `multi_path_reversible` pattern is now DETERMINISTIC: path
  assignment is a fixed function of F_g with counts 20/20/15/10/10/5
  across the 6 F_g values. `p_switch = 0.35` dropped from both the
  scenario call and the `params` block in the fixture; comment block
  rewritten to describe the deterministic design and cite the REGISTRY
  note for the rationale behind the design choice.

Fixture regenerated; scenario 14 params no longer carry the stale
`p_switch` entry. Point and SE parity numbers unchanged
(deterministic DGP produces the same treatment matrix as before).
Tests pass (2/2); ruff clean.

Co-Authored-By: Claude Opus 4.7 (1M context) &lt;noreply@anthropic.com&gt;
diff --git a/benchmarks/R/generate_dcdh_dynr_test_values.R b/benchmarks/R/generate_dcdh_dynr_test_values.R
@@ -618,14 +618,20 @@ scenarios$mixed_single_switch_by_path <- list(
 )
 
 # Scenario 14: multi_path_reversible + by_path=3 (top-k ranking case).
-# The multi_path_reversible DGP produces 3+ distinct observed paths via
-# random post-switch toggling. by_path=3 exercises the top-k selection
-# when observed paths exceed k. n_periods=10 gives every switch_time a
-# complete length-(L_max+1) window.
+# The `multi_path_reversible` pattern is a DETERMINISTIC multi-path DGP:
+# path assignment is a fixed function of F_g (so every (D_{g,1}, F_g,
+# S_g) cohort contains switchers from a single path), path proportions
+# are fixed at 20/20/15/10/10/5 across the 6 F_g values, and
+# post-window treatment is stable at path[L_max+1]. by_path=3 exercises
+# top-k selection when observed paths exceed k (4 observed paths, top-3
+# selected). n_periods=10 gives every switch_time a complete length-
+# (L_max+1) window. The old `p_switch`-driven random-toggle variant
+# (pre-PR) blew out SE parity with R via cross-path cohort mixing;
+# see the REGISTRY.md `Note (Phase 3 by_path ...)` Deviation bullet.
 cat("  Scenario 14: multi_path_reversible_by_path\n")
 d14 <- gen_reversible(n_groups = N_GOLDEN, n_periods = 10,
                       pattern = "multi_path_reversible", seed = 114,
-                      p_switch = 0.35, L_max = 3)
+                      L_max = 3)
 res14 <- did_multiplegt_dyn(
   df = d14, outcome = "outcome", group = "group", time = "period",
   treatment = "treatment", effects = 3, by_path = 3, ci_level = 95
@@ -634,7 +640,7 @@ scenarios$multi_path_reversible_by_path <- list(
   data = export_data(d14),
   params = list(pattern = "multi_path_reversible", n_groups = N_GOLDEN,
                 n_periods = 10, seed = 114, effects = 3, by_path = 3,
-                ci_level = 95, p_switch = 0.35),
+                ci_level = 95),
   results = extract_dcdh_by_path(res14, n_effects = 3)
 )
 
diff --git a/benchmarks/data/dcdh_dynr_golden_values.json b/benchmarks/data/dcdh_dynr_golden_values.json
@@ -660,8 +660,7 @@
         "seed": 114,
         "effects": 3,
         "by_path": 3,
-        "ci_level": 95,
-        "p_switch": 0.35
+        "ci_level": 95
       },
       "results": {
         "by_path": [
diff --git a/tests/test_chaisemartin_dhaultfoeuille_parity.py b/tests/test_chaisemartin_dhaultfoeuille_parity.py
@@ -545,6 +545,20 @@ def _compare_by_path(self, scenario, by_path, L_max, point_rtol, se_rtol):
         for r_path_entry in r_by_path:
             path_key = self._path_key_from_r_label(r_path_entry["path"])
             py_path = results.path_effects[path_key]
+
+            # Assert the public frequency_rank contract matches R. Both
+            # committed scenarios are constructed with unique path
+            # frequencies (scenario 13 via mixed_single_switch pattern,
+            # scenario 14 via deterministic counts 40/25/10/5) so rank
+            # ordering is unambiguous and must agree; a regression in
+            # path ranking or top-k tiebreak handling should fail here
+            # even if the selected path set and per-path effects remain
+            # correct.
+            assert py_path["frequency_rank"] == r_path_entry["frequency_rank"], (
+                f"path={path_key}: frequency_rank mismatch "
+                f"py={py_path['frequency_rank']} vs r={r_path_entry['frequency_rank']}"
+            )
+
             for h_str, r_h in r_path_entry["horizons"].items():
                 h = int(h_str)
                 assert (