Distributionally Robust Skeleton Learning of Discrete Bayesian Networks
About
We consider the problem of learning the exact skeleton of general discrete Bayesian networks from potentially corrupted data. Building on distributionally robust optimization and a regression approach, we propose to optimize the most adverse risk over a family of distributions within bounded Wasserstein distance or KL divergence to the empirical distribution. The worst-case risk accounts for the effect of outliers. The proposed approach applies for general categorical random variables without assuming faithfulness, an ordinal relationship or a specific form of conditional distribution. We present efficient algorithms and show the proposed methods are closely related to the standard regularized regression approach. Under mild assumptions, we derive non-asymptotic guarantees for successful structure learning with logarithmic sample complexities for bounded-degree graphs. Numerical study on synthetic and real datasets validates the effectiveness of our method. Code is available at https://github.com/DanielLeee/drslbn.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Skeleton Estimation | asia | F1 Score78 | 15 | |
| Bayesian Network Structure Recovery | asia 8 nodes (test) | F1 Score78 | 11 | |
| Bayesian Network Structure Learning | backache real-world 32 nodes | BIC Score-1.73e+3 | 6 | |
| Bayesian Network Structure Learning | connect-4 6000 samples 43 nodes | BIC-3.90e+4 | 6 | |
| Bayesian Network Structure Learning | voting real-world 17 nodes | BIC-2.45e+3 | 6 | |
| Skeleton Estimation | Cancer | F1 Score1 | 4 | |
| Skeleton Estimation | Earthquake | F1 Score93.33 | 3 | |
| Structure learning | backache | BIC-1.73e+3 | 2 | |
| Structure learning | voting | BIC-2.45e+3 | 2 |