Predictive variational inference: Learn the predictively optimal posterior distribution
About
Vanilla variational inference finds an optimal approximation to the Bayesian posterior distribution, but even the exact Bayesian posterior is often not meaningful under model misspecification. We propose predictive variational inference (PVI): a general inference framework that seeks and samples from an optimal posterior density such that the resulting posterior predictive distribution is as close to the true data generating process as possible, while this closeness is measured by multiple scoring rules. By optimizing the objective, the predictive variational inference is generally not the same as, or even attempting to approximate, the Bayesian posterior, even asymptotically. Rather, we interpret it as implicit hierarchical expansion. Further, the learned posterior uncertainty detects heterogeneity of parameters among the population, enabling automatic model diagnosis. This framework applies to both likelihood-exact and likelihood-free models. We demonstrate its application in real data examples.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Posterior Predictive Inference | posteriordb earnings (test) | Log Score-283.4 | 8 | |
| Posterior Predictive Inference | posteriordb kidscore (test) | Log Score-369.9 | 8 | |
| Posterior Predictive Inference | posteriordb nes (test) | Log Loss-226.6 | 8 | |
| Posterior Predictive Inference | posteriordb radon (test) | Log Score-3.37e+3 | 8 | |
| Posterior Predictive Inference | posteriordb election (test) | Log Score-1.60e+3 | 6 | |
| Posterior Predictive Inference | posteriordb wells (test) | Log Score-415.4 | 6 | |
| Posterior Predictive Inference | posteriordb glmm (test) | Log Score-1.15e+3 | 4 |