Distributionally Robust Skeleton Learning of Discrete Bayesian Networks

About

We consider the problem of learning the exact skeleton of general discrete Bayesian networks from potentially corrupted data. Building on distributionally robust optimization and a regression approach, we propose to optimize the most adverse risk over a family of distributions within bounded Wasserstein distance or KL divergence to the empirical distribution. The worst-case risk accounts for the effect of outliers. The proposed approach applies for general categorical random variables without assuming faithfulness, an ordinal relationship or a specific form of conditional distribution. We present efficient algorithms and show the proposed methods are closely related to the standard regularized regression approach. Under mild assumptions, we derive non-asymptotic guarantees for successful structure learning with logarithmic sample complexities for bounded-degree graphs. Numerical study on synthetic and real datasets validates the effectiveness of our method. Code is available at https://github.com/DanielLeee/drslbn.

Yeshu Li, Brian D. Ziebart• 2023

Related benchmarks

Task	Dataset	Result
Skeleton Estimation	asia	F1 Score78	15
Bayesian Network Structure Recovery	asia 8 nodes (test)	F1 Score78	11
Bayesian Network Structure Learning	backache real-world 32 nodes	BIC Score-1.73e+3	6
Bayesian Network Structure Learning	connect-4 6000 samples 43 nodes	BIC-3.90e+4	6
Bayesian Network Structure Learning	voting real-world 17 nodes	BIC-2.45e+3	6
Skeleton Estimation	Cancer	F1 Score1	4
Skeleton Estimation	Earthquake	F1 Score93.33	3
Structure learning	backache	BIC-1.73e+3	2
Structure learning	voting	BIC-2.45e+3	2

Showing 9 of 9 rows

Other info

Code

Follow for update

@wizwand_team Discord