Skip to content

Commit e53f6e5

Browse files
TomeHirataclaude
andcommitted
Move paper to paper/ subdirectory and add tutorial figures
- Move paper.md and paper.bib into paper/ to avoid top-level clutter - Add hillstorm_dte.png and oregon_ldte_costs_comparison.png from tutorials - Add figure captions showing simple vs ML-adjusted comparison for both the Hillstrom email campaign and Oregon Health Insurance Experiment - Cross-reference figures in Research Impact Statement - Add dataset citations (Hillstrom 2008, Finkelstein et al. 2012) - Add GitHub Action (draft-pdf.yml) to auto-build JOSS PDF on push using paper-path: paper/paper.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
1 parent 38ea69b commit e53f6e5

5 files changed

Lines changed: 42 additions & 1 deletion

File tree

.github/workflows/draft-pdf.yml

Lines changed: 19 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,19 @@
1+
on: [push]
2+
3+
jobs:
4+
paper:
5+
runs-on: ubuntu-latest
6+
name: Paper Draft
7+
steps:
8+
- name: Checkout
9+
uses: actions/checkout@v4
10+
- name: Build draft PDF
11+
uses: openjournals/openjournals-draft-action@master
12+
with:
13+
journal: joss
14+
paper-path: paper/paper.md
15+
- name: Upload
16+
uses: actions/upload-artifact@v4
17+
with:
18+
name: paper
19+
path: paper/paper.pdf

paper/hillstorm_dte.png

180 KB
Loading
67.6 KB
Loading

paper.bib renamed to paper/paper.bib

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -116,6 +116,24 @@ @ARTICLE{2020SciPy-NMeth
116116
doi = {10.1038/s41592-019-0686-2}
117117
}
118118

119+
@misc{hillstrom2008,
120+
title={The MineThatData E-Mail Analytics And Data Mining Challenge},
121+
author={Hillstrom, Kevin},
122+
year={2008},
123+
url={https://blog.minethatdata.com/2008/03/minethatdata-e-mail-analytics-and-data.html}
124+
}
125+
126+
@article{finkelstein2012,
127+
title={The Oregon Health Insurance Experiment: Evidence from the First Year},
128+
author={Finkelstein, Amy and Taubman, Sarah and Wright, Bill and Bernstein, Mira and Gruber, Jonathan and Newhouse, Joseph P. and Allen, Heidi and Baicker, Katherine and {Oregon Health Study Group}},
129+
journal={The Quarterly Journal of Economics},
130+
volume={127},
131+
number={3},
132+
pages={1057--1106},
133+
year={2012},
134+
doi={10.1093/qje/qjs020}
135+
}
136+
119137
@article{kobrosly2020causalcurve,
120138
title={causal-curve: A Python Causal Inference Package to Estimate Causal Dose-Response Curves},
121139
author={Kobrosly, Roni W.},

paper.md renamed to paper/paper.md

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -60,9 +60,13 @@ In the R ecosystem, packages like `qte` provide quantile treatment effect estima
6060

6161
All estimators implement a consistent API with three primary methods: `predict_dte()` for distributional treatment effects, `predict_pte()` for probability treatment effects over intervals, and `predict_qte()` for quantile treatment effects. The adjusted estimators use K-fold cross-fitting to prevent overfitting and support both single-task and multi-task learning modes [@hirata2025efficientscalableestimationdistributional] for computational efficiency. Bootstrap methods provide confidence intervals with multiple variance estimation approaches.
6262

63+
![Distributional treatment effects for the Hillstrom email marketing dataset [@hillstrom2008], comparing Women's vs Men's email campaigns. The simple estimator (left, purple) and ML-adjusted estimator (right, green) show that adjustment substantially tightens confidence bands, demonstrating the variance reduction benefit of regression adjustment.\label{fig:hillstrom}](hillstrom_dte.png)
64+
65+
![Local distributional treatment effects for emergency department costs in the Oregon Health Insurance Experiment [@finkelstein2012], estimated using `SimpleLocalDistributionEstimator` (left) and `AdjustedLocalDistributionEstimator` (right). Health insurance coverage shifts the distribution of ED costs, with ML adjustment again yielding narrower confidence intervals.\label{fig:oregon}](oregon_ldte_costs_comparison.png)
66+
6367
# Research Impact Statement
6468

65-
The methods implemented in `dte_adj` have been published at top machine learning venues: ICML 2024 [@byambadalai2024estimatingdistributionaltreatmenteffects] and ICML 2025 [@byambadalai2025efficientestimationdistributionaltreatment]. The package has been used internally at CyberAgent, Inc. for analyzing A/B tests where distributional impacts are critical, such as evaluating interventions on user engagement metrics where tail behavior matters more than averages. The documentation includes tutorials demonstrating applications to the Hillstrom email marketing dataset and the Oregon Health Insurance Experiment, facilitating adoption by researchers in economics, marketing, and healthcare.
69+
The methods implemented in `dte_adj` have been published at top machine learning venues: ICML 2024 [@byambadalai2024estimatingdistributionaltreatmenteffects] and ICML 2025 [@byambadalai2025efficientestimationdistributionaltreatment]. The package has been used internally at CyberAgent, Inc. for analyzing A/B tests where distributional impacts are critical, such as evaluating interventions on user engagement metrics where tail behavior matters more than averages. The documentation includes tutorials demonstrating applications to the Hillstrom email marketing dataset (\autoref{fig:hillstrom}) and the Oregon Health Insurance Experiment (\autoref{fig:oregon}), facilitating adoption by researchers in economics, marketing, and healthcare.
6670

6771
# AI Usage Disclosure
6872

0 commit comments

Comments
 (0)