DiffPO: A causal diffusion model for learning distributions of potential outcomes
About
Predicting potential outcomes of interventions from observational data is crucial for decision-making in medicine, but the task is challenging due to the fundamental problem of causal inference. Existing methods are largely limited to point estimates of potential outcomes with no uncertain quantification; thus, the full information about the distributions of potential outcomes is typically ignored. In this paper, we propose a novel causal diffusion model called DiffPO, which is carefully designed for reliable inferences in medicine by learning the distribution of potential outcomes. In our DiffPO, we leverage a tailored conditional denoising diffusion model to learn complex distributions, where we address the selection bias through a novel orthogonal diffusion loss. Another strength of our DiffPO method is that it is highly flexible (e.g., it can also be used to estimate different causal quantities such as CATE). Across a wide range of experiments, we show that our method achieves state-of-the-art performance.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| CATE estimation | ACIC 77 datasets 2016 (in-sample) | Percentage Best26.73 | 9 | |
| CATE estimation | ACIC 77 datasets 2016 (out-of-sample) | % Best25.82 | 9 | |
| CATE estimation | ACIC 2018 (in-sample) | Percent Best31.34 | 9 | |
| CATE estimation | ACIC 24 datasets 2018 (out-of-sample) | Best Performance Ratio36.86 | 9 | |
| Point Estimation of Potential Outcomes | Synthetic dataset In-sample (train) | RMSE (a=1)0.143 | 6 | |
| Point Estimation of Potential Outcomes | Synthetic dataset Out-of-sample (test) | RMSE (Treated Group)0.156 | 6 | |
| Uncertainty Estimation | Synthetic a=0 10-fold (test) | 95% PI Coverage98.1 | 6 | |
| Uncertainty Estimation | Synthetic a=1 10-fold (test) | 95% PI0.908 | 6 |