Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Improving Generative Methods for Causal Evaluation via Simulation-Based Inference

About

Generating synthetic datasets that accurately reflect real-world observational data is critical for evaluating causal estimators, but it remains a challenging task. Existing generative methods offer a solution by producing synthetic datasets anchored in the observed data (source data) while allowing variation in key parameters such as the treatment effect and amount of confounding bias. However, it is often unclear which generative methods to use and which values of parameters to choose when generating synthetic datasets. Moreover, existing methods typically require users to provide fixed point estimates of such parameters. This denies users the ability to express uncertainty over both generative methods and parameter values and removes the potential for posterior inference, potentially leading to unreliable estimator comparisons. We introduce simulation-based inference for causal evaluation (SBICE), a framework that treats the generative method and its corresponding generative parameters as uncertain and infers their posterior distribution given a source dataset. Leveraging techniques in simulation-based inference, SBICE identifies suitable generative methods and infers distributions over its parameter configurations to produce synthetic datasets closely aligned with the source data distribution. Empirical results demonstrate that SBICE improves the reliability of estimator evaluations by generating realistic datasets whose causal estimates closely match the estimates of the source data, making it a robust and uncertainty-aware approach to selecting causal estimators.

Pracheta Amaranath, Vinitra Muralikrishnan, Amit Sharma, David Jensen• 2025

Related benchmarks

TaskDatasetResultRank
Causal EvaluationLinearParam DGP1 Simulator Sim1 (synthetic)
SWD0.09
14
Causal EvaluationFrugal DGP4 Simulator: FrugalParam Sim4(u) (synthetic)
SWD2.26
14
Causal EvaluationFrugal DGP4 Simulator FrugalFlows Sim4(u) (synthetic)
SWD0.3
14
Causal EvaluationFrugal DGP3
SWD0.2
14
Causal EvaluationFrugal DGP5
SWD0.32
14
Causal EvaluationLalonde Obs
SWD0.28
14
Causal EstimationTwins Posterior
Mean BSE0.001
10
Causal EstimationLalonde Prior
Mean BSE3.52
7
Causal EstimationLalonde Posterior
Mean BSE0.016
7
Causal EstimationPostgres Prior
Mean BSE7.827
7
Showing 10 of 13 rows

Other info

Follow for update