Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Local Causal Discovery for Statistically Efficient Causal Inference

About

Causal discovery methods can identify valid adjustment sets for causal effect estimation for a pair of target variables, even when the underlying causal graph is unknown. Global causal discovery methods focus on learning the whole causal graph and therefore enable the recovery of optimal adjustment sets, i.e., sets with the lowest asymptotic variance, but they quickly become computationally prohibitive as the number of variables grows. Local causal discovery methods offer a more scalable alternative by focusing on the local neighborhood of the target variables, but are restricted to statistically suboptimal adjustment sets. In this work, we propose Local Optimal Adjustments Discovery (LOAD), a sound and complete causal discovery approach that combines the computational efficiency of local methods with the statistical optimality of global methods. First, LOAD identifies the causal relation between the targets and tests if the causal effect is identifiable by using only local information. If it is identifiable, it finds the possible descendants of the treatment and infers the optimal adjustment set as the parents of the outcome in a modified forbidden projection. Otherwise, it returns the locally valid parent adjustment sets. In our experiments on synthetic and realistic data LOAD outperforms global methods in scalability, while providing more accurate effect estimation than local methods.

M\'aty\'as Schubert, Tom Claassen, Sara Magliacane• 2025

Related benchmarks

TaskDatasetResultRank
Causal Structure LearningSynthetic nD=10000, d=2, dmax=10, 100 nodes
CI Test Count0.86
13
Causal Structure LearningSynthetic nD=10000, d=2, dmax=10, 200 nodes
Number of CI Tests1.49
13
Causal Structure LearningSynthetic nD=10000, d=2, dmax=10, 400 nodes
CI Test Count2.94
13
Causal Structure LearningSynthetic nD=10000, d=2, dmax=10, 600 nodes
Number of CI tests4.36
11
Causal Structure LearningSynthetic nD=10000, d=2, dmax=10, 800 nodes
Number of CI tests5.19e+3
11
Causal DiscoveryBinary data 20 nodes, nD=1000, d=2, dmax=10
Number of CI Tests239.6
7
Local Causal DiscoveryLinear Gaussian 100 nodes
CI Test Count (x10^3)950
7
Local Causal DiscoveryLinear Gaussian 200 nodes
CI Test Count (x10^3)1.79
7
Local Causal DiscoveryLinear Gaussian 400 nodes
Number of CI tests (x10^3)3.77
7
Causal DiscoveryBinary data 10 nodes, nD=1000, d=2, dmax=10
Number of CI tests127.8
7
Showing 10 of 18 rows

Other info

Follow for update