dCDH by_path Wave 5 #11: + heterogeneity (predict_het per-by_level) by igerber · Pull Request #412 · igerber/diff-diff

igerber · 2026-05-10T16:45:02Z

Summary

Lift gate at chaisemartin_dhaultfoeuille.py:1230-1234 so per-path event-study disaggregation composes with heterogeneity="<col>".
New _compute_path_heterogeneity_test helper + path_groups: Optional[Set[int]] filter on _compute_heterogeneity_test. Per-path beta / SE / inference computed on the path-restricted switcher subsample; cohort dummies absorb baseline by construction (no R-divergence warning needed).
Result schema: new path_heterogeneity_effects: Dict[Tuple[int, ...], Dict[int, Dict[str, Any]]] field; to_dataframe(level="by_path") extended with always-present het_* columns (NaN for placebo rows / when not requested).
R parity: introduces the FIRST predict_het parity baseline in the repo. Two new R generator scenarios (multi_path_reversible_predict_het global anchor + multi_path_reversible_by_path_predict_het per-path) using dont_drop_larger_lower=TRUE to match Python's drop_larger_lower=False requirement under reversal paths. Per-path beta / SE match R within rtol=1e-6.
Inference refresh hooks into the final R2 P1b block (per feedback_late_if_site_inference_refresh.md) so all per-path entries see the final df_survey after replicate-weight n_valid appends.
Multiplier bootstrap (n_bootstrap > 0) under by_path + heterogeneity + survey_design inherits the existing per-path multiplier-bootstrap-survey gate from PR Compose by_path / paths_of_interest with survey_design (Wave 4 #10) #408.

Methodology references (required if estimator / math changes)

Method name(s): de Chaisemartin & D'Haultfœuille reversible-treatment heterogeneity test (Web Appendix Section 1.5, Lemma 7) under per-path event-study disaggregation
Paper / source link(s): de Chaisemartin & D'Haultfœuille (2024) "Difference-in-Differences Estimators of Intertemporal Treatment Effects" — Web Appendix Section 1.5 (heterogeneity test); R did_multiplegt_dyn(..., by_path, predict_het) per-by_level dispatch (R/R/did_multiplegt_dyn.R:226-257)
Any intentional deviations from the source (and why): None on the per-path methodology — Python mirrors R's per-path predict_het exactly. The inherited cross-path cohort-sharing SE deviation from R for the OVERALL path_effects SE is unchanged and does not apply here (heterogeneity uses standard WLS coefficient IF, not the cohort-recentered IF allocator). R's predict_het dont_drop_larger_lower=TRUE flag is matched in fixture scenarios so reversal paths preserve cohort variation under heterogeneity testing.

Validation

Tests added/updated:
- tests/test_chaisemartin_dhaultfoeuille.py::TestByPathHeterogeneity — 12 tests across gate dispatch (5), behavior (4 incl. single-path telescope at atol=rtol=1e-14, zero-signal anti-regression, multi-baseline UserWarning anti-regression), DataFrame integration (1), edge cases (2)
- tests/test_chaisemartin_dhaultfoeuille_parity.py::TestDCDHDynRParityHeterogeneity — global anchor (FIRST predict_het parity baseline)
- tests/test_chaisemartin_dhaultfoeuille_parity.py::TestDCDHDynRParityByPathHeterogeneity — per-path
- benchmarks/R/generate_dcdh_dynr_test_values.R — 2 new scenarios (Add comprehensive test coverage for utils module #20 + Update roadmap with current implementation limitations #21), 2 new extraction helpers (extract_dcdh_predict_het, extract_dcdh_by_path_predict_het)
- benchmarks/data/dcdh_dynr_golden_values.json regenerated (21 scenarios, 619 KB)
Backtest / simulation / notebook evidence (if applicable): N/A (additive estimator extension; full R parity in fixture suite)

Security / privacy

Confirm no secrets/PII in this PR: Yes

Generated with Claude Code

github-actions · 2026-05-10T16:49:13Z

PR Review Report

Overall Assessment

⚠️ Needs changes: one unmitigated P1 finding from the required sibling-surface mirror audit.

Executive Summary

Affected method: ChaisemartinDHaultfoeuille dCDH reversible-treatment heterogeneity test, Web Appendix Section 1.5 / Lemma 7, under by_path and paths_of_interest.
Methodology registry now documents the per-path heterogeneity contract and the R-parity scope; I did not find an undocumented math or SE deviation in the estimator path.
Inference uses safe_inference() on the new path; pattern-wide grep found no new inline t_stat = effect / se anti-pattern.
P1: the new path_heterogeneity_effects surface is wired into the result dataclass and to_dataframe(level="by_path"), but not into summary(), unlike sibling per-path surfaces and global heterogeneity.
Could not run tests: pytest is not installed in this environment.

Methodology

Finding M-1: P3 Informational — documented partial/deviation surface, no action required

Severity: P3
Impact: The global heterogeneity implementation remains a documented partial predict_het port: post-treatment regressions only, no placebo regressions or joint null test. This is explicitly documented in docs/methodology/REGISTRY.md:L637, so it is not a defect under the review rules. The new per-path extension is also documented in the by-path note, including path-restricted switchers, unchanged variance machinery, survey composition, final df_survey refresh, and result schema at docs/methodology/REGISTRY.md:L643.
Concrete fix: None required.

Code Quality

Finding CQ-1: P1 — `summary()` does not render the new per-path heterogeneity surface

Severity: P1
Impact: The PR adds results.path_heterogeneity_effects and exposes it in to_dataframe(level="by_path"), but the sibling report surface is missing. Global heterogeneity has a dedicated summary renderer at diff_diff/chaisemartin_dhaultfoeuille_results.py:L1253-L1283, and existing per-path sibling surfaces render placebos, cumulated effects, and sup-t bands inside _render_path_effects_section at diff_diff/chaisemartin_dhaultfoeuille_results.py:L1308-L1431. The new heterogeneity field is only consumed by the DataFrame path at diff_diff/chaisemartin_dhaultfoeuille_results.py:L1823-L1906. This violates the required sibling-surface mirror audit (schema ↔ renderer, summary ↔ full) and makes summary() silently omit a computed public result.
Concrete fix: Extend _render_path_effects_section() to read self.path_heterogeneity_effects.get(path, {}) and render a per-path heterogeneity sub-block with beta, se, t_stat, and p_value, mirroring _render_heterogeneity_section(). Add a regression test asserting res.summary() contains the per-path heterogeneity block and path/horizon rows when by_path + heterogeneity is fit.

Performance

No P0/P1/P2 findings.

Maintainability

Finding MA-1: P1 — same as CQ-1

Severity: P1
Impact: The result schema and renderer are now out of sync for one public surface, increasing the chance future users or downstream tests treat summary() as complete when it is not.
Concrete fix: Same as CQ-1.

Tech Debt

No mitigating TODO entry found under TODO.md “Tech Debt from Code Reviews” for the missing summary surface. The P1 remains unmitigated.

Security

No findings.

Documentation/Tests

Finding DT-1: P2 — summary coverage missing for the new public surface

Severity: P2
Impact: The new tests cover gate dispatch, behavior, DataFrame integration, and R parity, but do not cover summary() for path_heterogeneity_effects. Existing sibling features have summary tests, so this gap allowed CQ-1 through.
Concrete fix: Add a TestByPathHeterogeneity::test_per_path_heterogeneity_renders_in_summary test.

Verification note: I could not run the targeted tests because pytest is not installed (/bin/bash: pytest: command not found). I did run static greps for the inference anti-pattern and sibling surfaces.

Path to Approval

Add summary rendering for path_heterogeneity_effects inside _render_path_effects_section().
Add a regression test that fits by_path + heterogeneity and asserts summary() includes the per-path heterogeneity rows.
Run the new test plus the existing TestByPathHeterogeneity and dCDH parity heterogeneity tests in an environment with pytest installed.

Extends `_render_path_effects_section` in chaisemartin_dhaultfoeuille_results.py to render a per-path "Heterogeneity Test (Section 1.5, partial)" sub-block when `path_heterogeneity_effects` is populated for the path. Mirrors the global `_render_heterogeneity_section` block scoped to each per-path section, alongside the existing per-path placebo / cumulated / sup-t blocks. Adds anti-regression test `TestByPathHeterogeneity::test_per_path_heterogeneity_renders_in_summary` that asserts both the section header and at least one rendered beta value appear in `summary()` output when `by_path + heterogeneity` is fit. Locks the schema↔renderer mirror so future per-path surfaces don't silently omit from `summary()`. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

igerber · 2026-05-10T16:53:35Z

/ai-review

github-actions · 2026-05-10T16:58:32Z

🔁 AI review rerun (requested by @igerber)

Head SHA: 7f7e3d5bec9fb80aadcb80f66f0fb4a408514077

Overall Assessment

✅ Looks good — no unmitigated P0 or P1 findings in this re-review.

Executive Summary

Prior P1 is resolved: summary() now renders per-path heterogeneity rows at diff_diff/chaisemartin_dhaultfoeuille_results.py:L1421-L1443, with a regression test at tests/test_chaisemartin_dhaultfoeuille.py:L10331-L10370.
Affected method: dCDH reversible-treatment heterogeneity test, Web Appendix Section 1.5 / Lemma 7, under by_path / paths_of_interest.
Methodology is documented in docs/methodology/REGISTRY.md:L637 and docs/methodology/REGISTRY.md:L643; I found no undocumented estimator, weighting, or SE deviation.
New inference paths use safe_inference() and preserve NaN-consistent inference fields.
Completed sibling-surface, pattern-wide grep, and by_path ↔ paths_of_interest symmetry audits. No workflow paths: changes were in scope.
I could not run tests locally because pytest is unavailable in this environment.

Methodology

M-1: P3 Informational — documented partial R `predict_het` surface

Severity: P3
Impact: The global heterogeneity implementation remains a documented partial port: post-treatment regressions only, without R’s placebo regressions or joint null test. This is explicitly documented as a deviation in docs/methodology/REGISTRY.md:L637, so it is not a defect under the review rules. The new per-path extension is also documented, including path-restricted switchers, variance machinery, survey composition, df refresh, and result schema at docs/methodology/REGISTRY.md:L643.
Concrete fix: None required.

No P0/P1 methodology findings. The new per-path helper restricts eligibility through path_groups inside _compute_heterogeneity_test at diff_diff/chaisemartin_dhaultfoeuille.py:L5025-L5027, and dispatches per selected path at diff_diff/chaisemartin_dhaultfoeuille.py:L6459-L6535.

Code Quality

No P0/P1/P2 findings.

The prior missing renderer is fixed in the sibling report surface at diff_diff/chaisemartin_dhaultfoeuille_results.py:L1421-L1443. Pattern-wide grep found no new inline t_stat = effect / se inference anti-pattern in the changed dCDH path; the new and refreshed fields route through safe_inference() at diff_diff/chaisemartin_dhaultfoeuille.py:L5106-L5223 and diff_diff/chaisemartin_dhaultfoeuille.py:L4058-L4070.

Performance

No findings.

Maintainability

No P0/P1/P2 findings.

Sibling surfaces are now aligned: result schema at diff_diff/chaisemartin_dhaultfoeuille_results.py:L597-L607, summary rendering at diff_diff/chaisemartin_dhaultfoeuille_results.py:L1421-L1443, and to_dataframe(level="by_path") het_* columns at diff_diff/chaisemartin_dhaultfoeuille_results.py:L1781-L1932.

Tech Debt

No untracked blocking tech debt found for this PR. Existing TODO.md review debt entries appear unrelated to the new per-path heterogeneity surface.

Security

No findings. A lightweight secret-pattern scan over the changed text did not find candidate secrets.

Documentation/Tests

DT-1: P3 Informational — local test execution unavailable

Severity: P3
Impact: I could not execute the targeted tests because this environment lacks pytest (/usr/bin/python: No module named pytest). Static review shows coverage was added for gates, behavior, DataFrame output, summary rendering, and R parity at tests/test_chaisemartin_dhaultfoeuille.py:L9977-L10370 and tests/test_chaisemartin_dhaultfoeuille_parity.py:L1324-L1472.
Concrete fix: Run the targeted dCDH heterogeneity and parity tests in CI or a full dev environment with test dependencies installed.

igerber · 2026-05-10T18:38:08Z

/ai-review

github-actions · 2026-05-10T18:41:20Z

🔁 AI review rerun (requested by @igerber)

Head SHA: 6c8a68ca40f85d2b2cbc6155e5ad0c9e5149de0f

Overall Assessment

✅ Looks good — no unmitigated P0/P1 findings in this re-review.

Executive Summary

Prior P1 is resolved: summary() now renders per-path heterogeneity rows at diff_diff/chaisemartin_dhaultfoeuille_results.py:L1421-L1443, with coverage at tests/test_chaisemartin_dhaultfoeuille.py:L10534-L10572.
Affected method: dCDH reversible-treatment heterogeneity test, Web Appendix Section 1.5 / Lemma 7, now under by_path and paths_of_interest.
Methodology is documented in docs/methodology/REGISTRY.md:L637 and docs/methodology/REGISTRY.md:L643; I found no undocumented estimator, weighting, or SE deviation.
New inference paths use safe_inference() and refresh replicate-weight df for per-path heterogeneity at diff_diff/chaisemartin_dhaultfoeuille.py:L4058-L4070.
Completed sibling-surface, pattern-wide grep, by_path ↔ paths_of_interest symmetry, and workflow-scope audits. No GH Actions workflow paths: changes are in scope.
I could not run tests locally because this environment lacks numpy, pandas, and pytest.

Methodology

M-1: P3 Informational — documented partial R `predict_het` surface

Severity: P3
Impact: The global heterogeneity implementation remains a documented partial port: post-treatment regressions only, without R’s placebo regressions or joint null test. This is explicitly documented in docs/methodology/REGISTRY.md:L637, so it is not a defect. The new per-path extension is documented at docs/methodology/REGISTRY.md:L643.
Concrete fix: None required.

No P0/P1 methodology findings. The path restriction is implemented through path_groups in _compute_heterogeneity_test() at diff_diff/chaisemartin_dhaultfoeuille.py:L5025-L5027 and dispatched by _compute_path_heterogeneity_test() at diff_diff/chaisemartin_dhaultfoeuille.py:L6459-L6535.

Code Quality

No P0/P1/P2 findings.

The new result schema is wired into the result dataclass at diff_diff/chaisemartin_dhaultfoeuille_results.py:L597-L607, summary rendering at L1421-L1443, and to_dataframe(level="by_path") at L1819-L1937.

Performance

No findings.

Maintainability

No P0/P1/P2 findings.

Sibling surfaces are aligned: estimator output, summary rendering, DataFrame columns, docs, registry, and parity tests all reference the same {path: {l: ...}} path_heterogeneity_effects contract.

Tech Debt

No untracked blocking tech debt found for this PR. Existing TODO/registry limitations are unrelated or explicitly documented.

Security

No findings. A lightweight secret-pattern scan over the changed files found no candidate secrets.

Documentation/Tests

DT-1: P3 Informational — local test execution unavailable

Severity: P3
Impact: I could not execute the targeted tests because this environment lacks required test dependencies. Static review shows coverage for gates, behavior, summary, DataFrame output, survey composition, and R parity.
Concrete fix: Run the targeted dCDH heterogeneity and parity tests in CI or a full dev environment.

DT-2: P3 Informational — multi-baseline no-warning test is not actually multi-baseline

Severity: P3
Impact: tests/test_chaisemartin_dhaultfoeuille.py:L10276-L10302 claims to cover the multi-baseline no-warning anti-regression, but _by_path_het_data() uses paths that all start from baseline 0. This does not block approval because the behavior is documented and the methodology path is not silently wrong.
Concrete fix: Add a small fixture with switchers on at least two baseline values and assert no multi-baseline warning under by_path + heterogeneity.

Lifts the gate at chaisemartin_dhaultfoeuille.py:1230-1234 so per-path event-study disaggregation composes with heterogeneity="<col>" (Web Appendix Section 1.5, Lemma 7), mirroring R did_multiplegt_dyn(..., by_path, predict_het) per-by_level dispatch. Per-path heterogeneity is computed by re-running the Lemma 7 regression on each path-restricted switcher subsample. New `path_groups` (Optional[Set[int]]) parameter on _compute_heterogeneity_test restricts eligibility to switchers ON path p; the variance machinery (standard WLS vcov for non-survey, cell-period IF allocator for Binder TSL, group-level allocator for Rao-Wu replicate) is unchanged from the global heterogeneity path. Cohort dummies absorb baseline by construction, so multi-baseline switcher panels do not produce R-divergence (no parallel UserWarning like controls / trends_linear). Surfaces on results.path_heterogeneity_effects keyed {path: {l: {beta, se, t_stat, p_value, conf_int, n_obs}}} and on to_dataframe(level="by_path") via new always-present het_* columns, populated for positive horizons and NaN otherwise (mirrors cband_* / cumulated_* convention). Per-(path, horizon) inference is refreshed in the final R2 P1b block so all surfaces use the same df_survey after replicate-weight n_valid appends. R parity: introduces the FIRST predict_het R-parity baseline in the repo. Two new scenarios (multi_path_reversible_predict_het global anchor + multi_path_reversible_by_path_predict_het per-path) use dont_drop_larger_lower=TRUE to match drop_larger_lower=False and provide cohort variation under reversal paths. Per-path beta and SE match R within rtol=1e-6. Multiplier bootstrap (n_bootstrap > 0) under by_path + heterogeneity + survey_design inherits the existing per-path multiplier-bootstrap gate from PR #408. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Extends `_render_path_effects_section` in chaisemartin_dhaultfoeuille_results.py to render a per-path "Heterogeneity Test (Section 1.5, partial)" sub-block when `path_heterogeneity_effects` is populated for the path. Mirrors the global `_render_heterogeneity_section` block scoped to each per-path section, alongside the existing per-path placebo / cumulated / sup-t blocks. Adds anti-regression test `TestByPathHeterogeneity::test_per_path_heterogeneity_renders_in_summary` that asserts both the section header and at least one rendered beta value appear in `summary()` output when `by_path + heterogeneity` is fit. Locks the schema↔renderer mirror so future per-path surfaces don't silently omit from `summary()`. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Adds 3 slow regression tests for `by_path + heterogeneity + survey_design` composition that the original PR claimed via REGISTRY/CHANGELOG but lacked automated coverage: 1. `test_per_path_heterogeneity_under_survey_finite` — analytical Binder TSL SE finite per (path, l) under PSU=group survey design. 2. `test_per_path_heterogeneity_replicate_weights_propagates_n_valid` — BRR replicate-weight fit drives `_replicate_n_valid_list` through per-(path, l) heterogeneity calls and final `survey_metadata.df_survey == n_replicates - 1`. 3. `test_survey_design_plus_n_bootstrap_with_heterogeneity_still_raises` — confirms heterogeneity composition does not accidentally re-route around the existing per-path multiplier-bootstrap-survey gate. Also extends `to_dataframe(level="by_path")` docstring at `chaisemartin_dhaultfoeuille_results.py:1527` to list the new `het_*` columns alongside `cband_*` and `cumulated_*`, mirroring the always-present NaN-when-None convention. The empty-DataFrame schema already includes them; this is a docstring-only sync. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

igerber · 2026-05-11T00:56:04Z

/ai-review

github-actions · 2026-05-11T00:57:35Z

🔁 AI review rerun (requested by @igerber)

Head SHA: a57f5834601f5b31e7561a7db1e45094acd3323b

Overall Assessment

⚠️ Needs changes — one unmitigated P2 claim-vs-test mismatch remains.

Executive Summary

Affected method: ChaisemartinDHaultfoeuille dCDH heterogeneity test (Web Appendix Section 1.5 / Lemma 7), newly composed with by_path / paths_of_interest.
Methodology implementation is broadly consistent with the visible REGISTRY contract: per-path heterogeneity filters switchers by selected path and reuses the existing Lemma 7 regression / inference path.
Prior summary-rendering concern appears resolved: summary() now renders per-path heterogeneity rows.
Public surfaces are mostly wired: result dataclass field, summary(), and to_dataframe(level="by_path") all expose path_heterogeneity_effects / het_*.
Remaining blocker is a test-claim mismatch: the PR/CHANGELOG/REGISTRY claim multi-baseline per-path heterogeneity behavior is regression-tested, but the visible “multi-baseline” test fixture uses only baseline-0 switcher paths.

Methodology

M-1 — P3 Informational: documented partial heterogeneity implementation

Severity: P3
Impact: The dCDH heterogeneity test remains a partial implementation relative to R / the paper: post-treatment regressions only, no placebo regressions or joint null test. This is explicitly documented in the Methodology Registry under the dCDH heterogeneity notes, so it is not a defect.
Concrete fix: None required.

M-2 — No unmitigated methodology mismatch found in implementation

Severity: N/A
Impact: The new implementation path matches the visible REGISTRY claim for per-path heterogeneity: _compute_path_heterogeneity_test() enumerates selected paths and calls _compute_heterogeneity_test(..., path_groups=...), while _compute_heterogeneity_test() restricts eligible switchers via if path_groups is not None and g not in path_groups: continue (diff_diff/chaisemartin_dhaultfoeuille.py:_compute_heterogeneity_test, _compute_path_heterogeneity_test). This is consistent with the documented “path-restricted switcher subsample” contract.
Concrete fix: None.

Code Quality

No P0/P1/P2 code-quality findings in the visible changed source.

The new field is threaded through:

Fit result construction: diff_diff/chaisemartin_dhaultfoeuille.py near path_heterogeneity_effects=...
Result dataclass: diff_diff/chaisemartin_dhaultfoeuille_results.py near path_heterogeneity_effects
Summary rendering: ChaisemartinDHaultfoeuilleResults._render_path_effects_section
DataFrame rendering: ChaisemartinDHaultfoeuilleResults.to_dataframe(level="by_path")

Performance

No findings.

The path heterogeneity helper re-enumerates treatment paths after path_effects already did so. This is documented in-code and is likely acceptable given the feature scope; no performance blocker.

Maintainability

No unmitigated maintainability findings.

The schema shape {path: {l: {...}}} is documented in source docstrings and handled separately from path_effects[path]["horizons"], which avoids overloading the existing path-effect schema.

Tech Debt

No untracked blocking tech debt found in the visible context.

Security

No findings in the visible diff. I did not run a secret scanner or shell grep; this is based only on the loaded prompt content.

Documentation/Tests

DT-1 — P2: claimed multi-baseline behavior is not actually tested

Severity: P2
Impact: The PR/CHANGELOG/REGISTRY explicitly claim that per-path heterogeneity on multi-baseline switcher panels does not produce the R-divergence warning used for controls / trends_linear, because cohort dummies absorb baseline. The visible test named test_per_path_heterogeneity_no_multi_baseline_warning does not actually construct a multi-baseline switcher panel: _by_path_het_data() defines paths [(0, 1, 1, 1), (0, 1, 0, 0), (0, 1, 1, 0)], all starting from baseline 0 (tests/test_chaisemartin_dhaultfoeuille.py:_by_path_het_data). As a result, the shipped test does not cover the explicitly claimed multi-baseline behavior.

This is a claim-vs-test mismatch under the single-pass audit rules. No TODO mitigation is visible in the prompt, and shipped-behavior test gaps are P2.

Concrete fix: Add or correct a test with at least two switcher baseline values, e.g.:

Include baseline-0 joiner paths such as (0, 1, 1, 1).
Include baseline-1 leaver paths such as (1, 0, 0, 0) or another valid reversible path beginning with 1.
Fit with ChaisemartinDHaultfoeuille(drop_larger_lower=False, by_path=2) or paths_of_interest=[(0,1,1,1), (1,0,0,0)], L_max=3, heterogeneity="het_x".
Assert:
1. res.path_heterogeneity_effects contains both selected paths or the expected observed subset.
2. No warning matching the controls/trends-linear multi-baseline divergence text is emitted.
3. At least one per-path heterogeneity entry is finite where the design has sufficient observations.

Path to Approval

Add a real multi-baseline per-path heterogeneity test as described in DT-1, using switcher paths with different first elements / baseline treatment values.
Ensure the new test filters out unrelated expected warnings, but fails on any warning containing the multi-baseline R-divergence language used by by_path + controls / by_path + trends_linear.
Re-run the targeted dCDH heterogeneity tests and parity tests in CI.

igerber added ready-for-ci Triggers CI test workflows and removed ready-for-ci Triggers CI test workflows labels May 10, 2026

igerber mentioned this pull request May 10, 2026

Replace CI Codex agent with single-shot Responses API + tighten verdict bar #415

Merged

igerber and others added 3 commits May 10, 2026 20:55

igerber force-pushed the dcdh-by-path-heterogeneity branch from 6c8a68c to a57f583 Compare May 11, 2026 00:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dCDH by_path Wave 5 #11: + heterogeneity (predict_het per-by_level)#412

dCDH by_path Wave 5 #11: + heterogeneity (predict_het per-by_level)#412
igerber wants to merge 3 commits intomainfrom
dcdh-by-path-heterogeneity

igerber commented May 10, 2026

Uh oh!

github-actions Bot commented May 10, 2026

Uh oh!

igerber commented May 10, 2026

Uh oh!

github-actions Bot commented May 10, 2026

Uh oh!

igerber commented May 10, 2026

Uh oh!

github-actions Bot commented May 10, 2026

Uh oh!

igerber commented May 11, 2026

Uh oh!

github-actions Bot commented May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

igerber commented May 10, 2026

Summary

Methodology references (required if estimator / math changes)

Validation

Security / privacy

Uh oh!

github-actions Bot commented May 10, 2026

PR Review Report

Overall Assessment

Executive Summary

Methodology

Finding M-1: P3 Informational — documented partial/deviation surface, no action required

Code Quality

Finding CQ-1: P1 — summary() does not render the new per-path heterogeneity surface

Performance

Maintainability

Finding MA-1: P1 — same as CQ-1

Tech Debt

Security

Documentation/Tests

Finding DT-1: P2 — summary coverage missing for the new public surface

Path to Approval

Uh oh!

igerber commented May 10, 2026

Uh oh!

github-actions Bot commented May 10, 2026

Overall Assessment

Executive Summary

Methodology

M-1: P3 Informational — documented partial R predict_het surface

Code Quality

Performance

Maintainability

Tech Debt

Security

Documentation/Tests

DT-1: P3 Informational — local test execution unavailable

Uh oh!

igerber commented May 10, 2026

Uh oh!

github-actions Bot commented May 10, 2026

Overall Assessment

Executive Summary

Methodology

M-1: P3 Informational — documented partial R predict_het surface

Code Quality

Performance

Maintainability

Tech Debt

Security

Documentation/Tests

DT-1: P3 Informational — local test execution unavailable

DT-2: P3 Informational — multi-baseline no-warning test is not actually multi-baseline

Uh oh!

igerber commented May 11, 2026

Uh oh!

github-actions Bot commented May 11, 2026

Overall Assessment

Executive Summary

Methodology

M-1 — P3 Informational: documented partial heterogeneity implementation

M-2 — No unmitigated methodology mismatch found in implementation

Code Quality

Performance

Maintainability

Tech Debt

Security

Documentation/Tests

DT-1 — P2: claimed multi-baseline behavior is not actually tested

Path to Approval

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Finding CQ-1: P1 — `summary()` does not render the new per-path heterogeneity surface

M-1: P3 Informational — documented partial R `predict_het` surface

M-1: P3 Informational — documented partial R `predict_het` surface