Improved Variational Bayesian Phylogenetic Inference with Normalizing Flows
About
Variational Bayesian phylogenetic inference (VBPI) provides a promising general variational framework for efficient estimation of phylogenetic posteriors. However, the current diagonal Lognormal branch length approximation would significantly restrict the quality of the approximating distributions. In this paper, we propose a new type of VBPI, VBPI-NF, as a first step to empower phylogenetic posterior estimation with deep learning techniques. By handling the non-Euclidean branch length space of phylogenetic models with carefully designed permutation equivariant transformations, VBPI-NF uses normalizing flows to provide a rich family of flexible branch length distributions that generalize across different tree topologies. We show that VBPI-NF significantly improves upon the vanilla VBPI on a benchmark of challenging real data Bayesian phylogenetic inference problems. Further investigation also reveals that the structured parameterization in those permutation equivariant transformations can provide additional amortization benefit.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Marginal log-likelihood estimation | DS1 27 taxa, 1949 sites 1.0 (test) | Mean Log-Likelihood-7.11e+3 | 6 | |
| Marginal log-likelihood estimation | DS2 29 taxa, 2520 sites 1.0 (test) | Mean Log-Likelihood-2.64e+4 | 6 | |
| Marginal log-likelihood estimation | DS3 (36 taxa, 1812 sites) 1.0 (test) | Mean Log-Likelihood-3.37e+4 | 6 | |
| Marginal log-likelihood estimation | DS4 (41 taxa, 1137 sites) 1.0 (test) | Mean Log-Likelihood-1.33e+4 | 6 | |
| Marginal log-likelihood estimation | DS5 50 taxa 378 sites 1.0 (test) | Mean Log-Likelihood-8.21e+3 | 6 | |
| Marginal log-likelihood estimation | DS6 50 taxa 1133 sites 1.0 (test) | Mean Log-Likelihood-6.72e+3 | 6 | |
| Marginal log-likelihood estimation | DS7 64 taxa, 1008 sites 1.0 (test) | Mean Log-Likelihood-8.65e+3 | 5 |