Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

On Learning Necessary and Sufficient Causal Graphs

About

The causal revolution has stimulated interest in understanding complex relationships in various fields. Most of the existing methods aim to discover causal relationships among all variables within a complex large-scale graph. However, in practice, only a small subset of variables in the graph are relevant to the outcomes of interest. Consequently, causal estimation with the full causal graph -- particularly given limited data -- could lead to numerous falsely discovered, spurious variables that exhibit high correlation with, but exert no causal impact on, the target outcome. In this paper, we propose learning a class of necessary and sufficient causal graphs (NSCG) that exclusively comprises causally relevant variables for an outcome of interest, which we term causal features. The key idea is to employ probabilities of causation to systematically evaluate the importance of features in the causal graph, allowing us to identify a subgraph relevant to the outcome of interest. To learn NSCG from data, we develop a necessary and sufficient causal structural learning (NSCSL) algorithm, by establishing theoretical properties and relationships between probabilities of causation and natural causal effects of features. Across empirical studies of simulated and real data, we demonstrate that NSCSL outperforms existing algorithms and can reveal crucial yeast genes for target heritable traits of interest.

Hengrui Cai, Yixin Wang, Michael Jordan, Rui Song• 2023

Related benchmarks

TaskDatasetResultRank
Causal DiscoverySachs real data d=11
SHD8
19
Causal DiscoveryNonlinear structural equation model S4
FDR0.03
9
Causal DiscoveryScale-free (SF) model Scenario S5 p=50 n=1000 Degree=5 (test)
FDR2
9
Causal Structural LearningErdős-Rényi (ER) Model n=100 S4 (small)
FDR11
9
Causal Structural LearningErdős-Rényi (ER) Model n=1000 S4 (large)
FDR0.00e+0
9
Causal Structural LearningErdős-Rényi (ER) Model n=1000 Scenario S5 (small)
FDR2
9
Causal Structural LearningErdős-Rényi (ER) Model n=3000 Scenario S5 (large)
FDR0.01
9
Causal DiscoveryYeast gene data YER124C
Total Edges11
8
Causal DiscoveryScenario S1 (n=30) synthetic (test)
FDR8
5
Causal DiscoveryScenario S1 (n=100) synthetic (test)
FDR2
5
Showing 10 of 14 rows

Other info

Follow for update