On Inductive Biases for Heterogeneous Treatment Effect Estimation
About
We investigate how to exploit structural similarities of an individual's potential outcomes (POs) under different treatments to obtain better estimates of conditional average treatment effects in finite samples. Especially when it is unknown whether a treatment has an effect at all, it is natural to hypothesize that the POs are similar - yet, some existing strategies for treatment effect estimation employ regularization schemes that implicitly encourage heterogeneity even when it does not exist and fail to fully make use of shared structure. In this paper, we investigate and compare three end-to-end learning strategies to overcome this problem - based on regularization, reparametrization and a flexible multi-task architecture - each encoding inductive bias favoring shared behavior across POs. To build understanding of their relative strengths, we implement all strategies using neural networks and conduct a wide range of semi-synthetic experiments. We observe that all three approaches can lead to substantial improvements upon numerous baselines and gain insight into performance differences across various experimental settings.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| pancytopenia onset prediction | Claims | Rate10 | 60 | |
| Treatment Effect Estimation | RORCO semi-synthetic | MSE1.92e-4 | 22 | |
| Treatment Effect Estimation | ACIC semi-synthetic 2016 (test) | Mean Error6.65e-4 | 22 | |
| Treatment Effect Estimation | ACIC semi-synthetic 2017 | Mean TEE Error1.87e-4 | 22 | |
| Treatment Effect Estimation | RORCO Real | Mean Error-0.0593 | 22 | |
| Treatment Effect Estimation | NEWS semi-synthetic (test) | MSE2.26e-5 | 22 | |
| Treatment Effect Estimation | NEWS semi-synthetic | Mean Error2.26e-5 | 22 | |
| Treatment Effect Estimation | JOBS semi-synthetic (test) | MSE0.0021 | 22 | |
| Causal Inference | IHDP | MSE0.438 | 20 | |
| CATE estimation | IHDP Setup C (in-sample) | NRMSE0.268 | 17 |