Skip to content

Commit e7d898a

Browse files
authored
Merge pull request #190 from igerber/docs/update-roadmap
Update roadmap for v2.6.0
2 parents 290b13a + 44edc6b commit e7d898a

1 file changed

Lines changed: 37 additions & 68 deletions

File tree

ROADMAP.md

Lines changed: 37 additions & 68 deletions
Original file line numberDiff line numberDiff line change
@@ -8,9 +8,9 @@ For past changes and release history, see [CHANGELOG.md](CHANGELOG.md).
88

99
## Current Status
1010

11-
diff-diff v2.4.1 is a **production-ready** DiD library with feature parity with R's `did` + `HonestDiD` + `synthdid` ecosystem for core DiD analysis:
11+
diff-diff v2.6.0 is a **production-ready** DiD library with feature parity with R's `did` + `HonestDiD` + `synthdid` ecosystem for core DiD analysis:
1212

13-
- **Core estimators**: Basic DiD, TWFE, MultiPeriod, Callaway-Sant'Anna, Sun-Abraham, Borusyak-Jaravel-Spiess Imputation, Synthetic DiD, Triple Difference (DDD), TROP, Two-Stage DiD (Gardner 2022), Stacked DiD (Wing et al. 2024)
13+
- **Core estimators**: Basic DiD, TWFE, MultiPeriod, Callaway-Sant'Anna, Sun-Abraham, Borusyak-Jaravel-Spiess Imputation, Synthetic DiD, Triple Difference (DDD), TROP, Two-Stage DiD (Gardner 2022), Stacked DiD (Wing et al. 2024), Continuous DiD (Callaway, Goodman-Bacon & Sant'Anna 2024)
1414
- **Valid inference**: Robust SEs, cluster SEs, wild bootstrap, multiplier bootstrap, placebo-based variance
1515
- **Assumption diagnostics**: Parallel trends tests, placebo tests, Goodman-Bacon decomposition
1616
- **Sensitivity analysis**: Honest DiD (Rambachan-Roth), Pre-trends power analysis (Roth 2022)
@@ -20,35 +20,15 @@ diff-diff v2.4.1 is a **production-ready** DiD library with feature parity with
2020

2121
---
2222

23-
## Near-Term Enhancements (v2.5)
24-
25-
High-value additions building on our existing foundation.
26-
27-
### ~~Stacked Difference-in-Differences~~ (Implemented in v2.5)
28-
29-
Implemented as `StackedDiD`. See `diff_diff/stacked_did.py`.
23+
## Near-Term Enhancements (v2.7)
3024

3125
### Staggered Triple Difference (DDD)
3226

33-
Extend the existing `TripleDifference` estimator to handle staggered adoption settings. The current implementation handles 2-period DDD; this extends to multi-period designs.
27+
Extend the existing `TripleDifference` estimator to handle staggered adoption settings.
3428

35-
**Multi-period/Staggered Support:**
3629
- Group-time ATT(g,t) for DDD designs with variation in treatment timing
37-
- Handle settings where groups adopt at different times
38-
- Multiple comparison groups (never-treated, not-yet-treated in either dimension)
39-
- `StaggeredTripleDifference` class or extended `TripleDifference` with `first_treat` parameter
40-
41-
**Event Study Aggregation:**
42-
- Dynamic treatment effects over time (event study coefficients)
43-
- Pre-treatment placebo effects for parallel trends assessment
44-
- `aggregate='event_study'` parameter like `CallawaySantAnna`
45-
- Integration with `plot_event_study()` visualization
46-
47-
**Multiplier Bootstrap Inference:**
30+
- Event study aggregation and pre-treatment placebo effects
4831
- Multiplier bootstrap for valid inference in staggered settings
49-
- Rademacher, Mammen, and Webb weight options (matching existing estimators)
50-
- `n_bootstrap` parameter and `DDDBootstrapResults` class
51-
- Clustered bootstrap for panel data
5232

5333
**Reference**: [Ortiz-Villavicencio & Sant'Anna (2025)](https://arxiv.org/abs/2505.09942). *Working Paper*. R package: `triplediff`.
5434

@@ -60,22 +40,13 @@ Extend the existing `TripleDifference` estimator to handle staggered adoption se
6040

6141
---
6242

63-
## Medium-Term Enhancements (v2.5+)
64-
65-
Extending diff-diff to handle more complex settings.
43+
## Medium-Term Enhancements
6644

67-
### Continuous Treatment DiD
45+
### Efficient DiD Estimators
6846

69-
Many treatments have dose/intensity rather than binary on/off. Active research area with recent breakthroughs.
47+
Semiparametrically efficient versions of existing DiD/event-study estimators with 40%+ precision gains over current methods.
7048

71-
- Treatment effect on treated (ATT) parameters under generalized parallel trends
72-
- Dose-response curves and marginal effects
73-
- Handle settings where "dose" varies across units and time
74-
- Event studies with continuous treatments
75-
76-
**References**:
77-
- [Callaway, Goodman-Bacon & Sant'Anna (2024)](https://arxiv.org/abs/2107.02637). *NBER Working Paper*.
78-
- [de Chaisemartin, D'Haultfœuille & Vazquez-Bare (2024)](https://arxiv.org/abs/2402.05432). *AEA Papers and Proceedings*.
49+
**Reference**: [Chen, Sant'Anna & Xie (2025)](https://arxiv.org/abs/2506.17729). *Working Paper*.
7950

8051
### de Chaisemartin-D'Haultfœuille Estimator
8152

@@ -84,7 +55,6 @@ Handles treatment that switches on and off (reversible treatments), unlike most
8455
- Allows units to move into and out of treatment
8556
- Time-varying, heterogeneous treatment effects
8657
- Comparison with never-switchers or flexible control groups
87-
- Different assumptions than CS/SA—useful for different settings
8858

8959
**Reference**: [de Chaisemartin & D'Haultfœuille (2020, 2024)](https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3980758). *American Economic Review*.
9060

@@ -95,7 +65,6 @@ Implements local projections for dynamic treatment effects. Doesn't require spec
9565
- Flexible impulse response estimation
9666
- Robust to misspecification of dynamics
9767
- Natural handling of anticipation effects
98-
- Growing use in macroeconomics and policy evaluation
9968

10069
**Reference**: Dube, Girardi, Jordà, and Taylor (2023).
10170

@@ -105,30 +74,16 @@ For outcomes where linear models are inappropriate (binary, count, bounded).
10574

10675
- Logit/probit DiD for binary outcomes
10776
- Poisson DiD for count outcomes
108-
- Flexible strategies for staggered designs with nonlinear models
10977
- Proper handling of incidence rate ratios and odds ratios
11078

11179
**Reference**: [Wooldridge (2023)](https://academic.oup.com/ectj/article/26/3/C31/7250479). *The Econometrics Journal*.
11280

113-
### Doubly Robust DiD + Synthetic Control
114-
115-
Unified framework combining DiD and synthetic control with doubly robust identification—valid under *either* parallel trends or synthetic control assumptions.
116-
117-
- ATT identified under parallel trends OR group-level SC condition
118-
- Semiparametric estimation framework
119-
- Multiplier bootstrap for valid inference under either assumption
120-
- Strengthens credibility by avoiding the DiD vs. SC trade-off
121-
122-
**Reference**: [Sun, Xie & Zhang (2025)](https://arxiv.org/abs/2503.11375). *Working Paper*.
123-
12481
### Causal Duration Analysis with DiD
12582

12683
Extends DiD to duration/survival outcomes where standard methods fail (hazard rates, time-to-event).
12784

12885
- Duration analogue of parallel trends on hazard rates
12986
- Avoids distributional assumptions and hazard function specification
130-
- Visual and formal pre-trends assessment for duration data
131-
- Handles absorbing states approaching probability bounds
13287

13388
**Reference**: [Deaner & Ku (2025)](https://www.aeaweb.org/conference/2025/program/paper/k77Kh8iS). *AEA Conference Paper*.
13489

@@ -138,38 +93,56 @@ Extends DiD to duration/survival outcomes where standard methods fail (hazard ra
13893

13994
Frontier methods requiring more research investment.
14095

141-
### Matrix Completion Methods
96+
### DiD with Interference / Spillovers
14297

143-
Unified framework encompassing synthetic control and regression approaches. Moves seamlessly between cross-sectional and time-series patterns.
98+
Standard DiD assumes SUTVA; spatial/network spillovers violate this. Two-stage imputation approach estimates treatment AND spillover effects under staggered timing.
14499

145-
- Nuclear norm regularization for low-rank structure
146-
- Handles missing data patterns common in panel settings
147-
- Bridges synthetic control (few units, many periods) and regression (many units, few periods)
148-
- Confidence intervals via debiasing
100+
**Reference**: [Butts (2024)](https://arxiv.org/abs/2105.03737). *Working Paper*.
149101

150-
**Reference**: [Athey et al. (2021)](https://arxiv.org/abs/1710.10251). *Journal of the American Statistical Association*.
102+
### Quantile/Distributional DiD
103+
104+
Recover the full counterfactual distribution and quantile treatment effects (QTT), not just mean ATT. Goes beyond "what's the average effect" to "who gains, who loses."
105+
106+
- Changes-in-Changes (CiC) identification strategy
107+
- QTT(τ) at user-specified quantiles
108+
- Full counterfactual distribution function
109+
- Two-period foundation, then staggered extension
110+
111+
**Reference**: [Athey & Imbens (2006)](https://onlinelibrary.wiley.com/doi/10.1111/j.1468-0262.2006.00668.x). *Econometrica*.
112+
113+
### CATT Meta-Learner for Heterogeneous Effects
114+
115+
ML-powered conditional ATT — discover who benefits most from treatment using doubly robust meta-learner.
116+
117+
**Reference**: [Lan, Chang, Dillon & Syrgkanis (2025)](https://arxiv.org/abs/2502.04699). *Working Paper*.
151118

152119
### Causal Forests for DiD
153120

154121
Machine learning methods for discovering heterogeneous treatment effects in DiD settings.
155122

156123
- Estimate treatment effect heterogeneity across covariates
157124
- Data-driven subgroup discovery
158-
- Combine with DiD identification for observational data
159125
- Honest confidence intervals for discovered heterogeneity
160126

161127
**References**:
162128
- [Kattenberg, Scheer & Thiel (2023)](https://ideas.repec.org/p/cpb/discus/452.html). *CPB Discussion Paper*.
163129
- Athey & Wager (2019). *Annals of Statistics*.
164130

131+
### Matrix Completion Methods
132+
133+
Unified framework encompassing synthetic control and regression approaches.
134+
135+
- Nuclear norm regularization for low-rank structure
136+
- Bridges synthetic control (few units, many periods) and regression (many units, few periods)
137+
138+
**Reference**: [Athey et al. (2021)](https://arxiv.org/abs/1710.10251). *Journal of the American Statistical Association*.
139+
165140
### Double/Debiased ML for DiD
166141

167142
For high-dimensional settings with many potential confounders.
168143

169144
- ML for nuisance parameter estimation (propensity, outcome models)
170145
- Cross-fitting for valid inference
171-
- Handles many covariates without overfitting concerns
172-
- Doubly-robust estimation with ML flexibility
173146

174147
**Reference**: Chernozhukov et al. (2018). *The Econometrics Journal*.
175148

@@ -183,10 +156,6 @@ For high-dimensional settings with many potential confounders.
183156

184157
## Infrastructure Improvements
185158

186-
Ongoing maintenance and developer experience.
187-
188-
### Documentation
189-
190159
- Video tutorials and worked examples
191160

192161
---

0 commit comments

Comments
 (0)