Local Causal Discovery for Estimating Causal Effects
About
Even when the causal graph underlying our data is unknown, we can use observational data to narrow down the possible values that an average treatment effect (ATE) can take by (1) identifying the graph up to a Markov equivalence class; and (2) estimating that ATE for each graph in the class. While the PC algorithm can identify this class under strong faithfulness assumptions, it can be computationally prohibitive. Fortunately, only the local graph structure around the treatment is required to identify the set of possible ATE values, a fact exploited by local discovery algorithms to improve computational efficiency. In this paper, we introduce Local Discovery using Eager Collider Checks (LDECC), a new local causal discovery algorithm that leverages unshielded colliders to orient the treatment's parents differently from existing methods. We show that there exist graphs where LDECC exponentially outperforms existing local discovery algorithms and vice versa. Moreover, we show that LDECC and existing algorithms rely on different faithfulness assumptions, leveraging this insight to weaken the assumptions for identifying the set of possible ATE values.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Causal Structure Learning | Synthetic nD=10000, d=2, dmax=10, 100 nodes | CI Test Count7.95 | 13 | |
| Causal Structure Learning | Synthetic nD=10000, d=2, dmax=10, 200 nodes | Number of CI Tests23 | 13 | |
| Causal Structure Learning | Synthetic nD=10000, d=2, dmax=10, 400 nodes | CI Test Count100.5 | 13 | |
| Causal Structure Learning | Synthetic nD=10000, d=2, dmax=10, 600 nodes | Number of CI tests179.1 | 11 | |
| Causal Structure Learning | Synthetic nD=10000, d=2, dmax=10, 800 nodes | Number of CI tests400.8 | 11 | |
| Causal Discovery | Binary data 15 nodes, nD=1000, d=2, dmax=10 | Number of CI Tests296 | 7 | |
| Local Causal Discovery | Linear Gaussian 100 nodes | CI Test Count (x10^3)4.93e+3 | 7 | |
| Local Causal Discovery | Linear Gaussian 200 nodes | CI Test Count (x10^3)16.29 | 7 | |
| Local Causal Discovery | Linear Gaussian 400 nodes | Number of CI tests (x10^3)61.89 | 7 | |
| Causal Discovery | Binary data 20 nodes, nD=1000, d=2, dmax=10 | Number of CI Tests384.7 | 7 |