Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Graphical Normalizing Flows

About

Normalizing flows model complex probability distributions by combining a base distribution with a series of bijective neural networks. State-of-the-art architectures rely on coupling and autoregressive transformations to lift up invertible functions from scalars to vectors. In this work, we revisit these transformations as probabilistic graphical models, showing they reduce to Bayesian networks with a pre-defined topology and a learnable density at each node. From this new perspective, we propose the graphical normalizing flow, a new invertible transformation with either a prescribed or a learnable graphical structure. This model provides a promising way to inject domain knowledge into normalizing flows while preserving both the interpretability of Bayesian networks and the representation capacity of normalizing flows. We show that graphical conditioners discover relevant graph structure when we cannot hypothesize it. In addition, we analyze the effect of $\ell_1$-penalization on the recovered structure and on the quality of the resulting density estimation. Finally, we show that graphical conditioners lead to competitive white box density estimators. Our implementation is available at https://github.com/AWehenkel/DAG-NF.

Antoine Wehenkel, Gilles Louppe• 2020

Related benchmarks

TaskDatasetResultRank
DAG Structure Recoverynon-linear-1 5000 samples
SHD3.4
48
Causal Discoverynon-linear-2 d=10, 5000 samples (test)
SHD3.3
12
Causal Discoverynon-linear-2 (d=20, 5000 samples) (test)
SHD9.7
12
Causal Structure LearningLinear Synthetic Data d=10 5000 samples
SHD2.4
12
Causal Structure LearningLinear Synthetic Data d=20, 5000 samples
SHD5.2
12
Causal Discoverynon-linear-2 d=50, 5000 samples (test)
Structural Hamming Distance24.3
12
Causal Discoverynon-linear-2 d=100, 5000 samples (test)
Structural Hamming Distance (SHD)68.1
12
Causal Structure LearningLinear Synthetic Data d=50, 5000 samples
SHD23.1
12
Causal Structure LearningLinear Synthetic Data d=100, 5000 samples
SHD64.2
12
DAG Structure RecoverySachs
SHD15
9
Showing 10 of 18 rows

Other info

Follow for update