Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Local Discovery by Partitioning: Polynomial-Time Causal Discovery Around Exposure-Outcome Pairs

About

Causal discovery is crucial for causal inference in observational studies, as it can enable the identification of valid adjustment sets (VAS) for unbiased effect estimation. However, global causal discovery is notoriously hard in the nonparametric setting, with exponential time and sample complexity in the worst case. To address this, we propose local discovery by partitioning (LDP): a local causal discovery method that is tailored for downstream inference tasks without requiring parametric and pretreatment assumptions. LDP is a constraint-based procedure that returns a VAS for an exposure-outcome pair under latent confounding, given sufficient conditions. The total number of independence tests performed is worst-case quadratic with respect to the cardinality of the variable set. Asymptotic theoretical guarantees are numerically validated on synthetic graphs. Adjustment sets from LDP yield less biased and more precise average treatment effect estimates than baseline discovery algorithms, with LDP outperforming on confounder recall, runtime, and test count for VAS discovery. Notably, LDP ran at least 1300x faster than baselines on a benchmark.

Jacqueline Maasch, Weishen Pan, Shantanu Gupta, Volodymyr Kuleshov, Kyra Gan, Fei Wang• 2023

Related benchmarks

TaskDatasetResultRank
Causal Structure LearningSynthetic nD=10000, d=2, dmax=10, 100 nodes
CI Test Count0.88
13
Causal Structure LearningSynthetic nD=10000, d=2, dmax=10, 200 nodes
Number of CI Tests1.51
13
Causal Structure LearningSynthetic nD=10000, d=2, dmax=10, 400 nodes
CI Test Count2.96
13
Causal Structure LearningSynthetic nD=10000, d=2, dmax=10, 600 nodes
Number of CI tests4.25
11
Causal Structure LearningSynthetic nD=10000, d=2, dmax=10, 800 nodes
Number of CI tests5.11e+3
11
Local Causal DiscoveryLinear Gaussian 200 nodes
CI Test Count (x10^3)1.75
7
Local Causal DiscoveryLinear Gaussian 400 nodes
Number of CI tests (x10^3)3.56
7
Causal DiscoveryBinary data 15 nodes, nD=1000, d=2, dmax=10
Number of CI Tests232.3
7
Local Causal DiscoveryLinear Gaussian 100 nodes
CI Test Count (x10^3)960
7
Causal DiscoveryBinary data 20 nodes, nD=1000, d=2, dmax=10
Number of CI Tests258.6
7
Showing 10 of 18 rows

Other info

Follow for update