PACOH: Bayes-Optimal Meta-Learning with PAC-Guarantees
About
Meta-learning can successfully acquire useful inductive biases from data. Yet, its generalization properties to unseen learning tasks are poorly understood. Particularly if the number of meta-training tasks is small, this raises concerns about overfitting. We provide a theoretical analysis using the PAC-Bayesian framework and derive novel generalization bounds for meta-learning. Using these bounds, we develop a class of PAC-optimal meta-learning algorithms with performance guarantees and a principled meta-level regularization. Unlike previous PAC-Bayesian meta-learners, our method results in a standard stochastic optimization problem which can be solved efficiently and scales well. When instantiating our PAC-optimal hyper-posterior (PACOH) with Gaussian processes and Bayesian Neural Networks as base learners, the resulting methods yield state-of-the-art performance, both in terms of predictive accuracy and the quality of uncertainty estimates. Thanks to their principled treatment of uncertainty, our meta-learners can also be successfully employed for sequential decision problems.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | miniImageNet 5-way 1-shot (meta-test) | Accuracy44.6 | 41 | |
| Natural Language Inference | 12 NLI tasks (CoNLL-2003, MIT-Restaurant, Amazon Reviews, Crowdflower) (test) | Avg Test Accuracy58.22 | 15 | |
| meta learning | Rand. Branin (meta-test) | Calibration Error0.105 | 6 | |
| meta learning | XGBoost (meta-test) | Calibration Error0.084 | 6 | |
| Meta-Regression | Rand. Branin (meta-test) | Test Log-Likelihood-2.507 | 6 | |
| Meta-Regression | GLMNET (meta-test) | Average Test Log-Likelihood1.369 | 6 | |
| meta learning | Camelb. Sin-Noise (meta-test) | Calibration Error0.054 | 6 | |
| meta learning | GLMNET (meta-test) | Calibration Error0.175 | 6 | |
| meta learning | RPart (meta-test) | Calibration Error0.151 | 6 | |
| Meta-Regression | Camelback Sin-Noise (meta-test) | Avg Test Log-Likelihood-0.716 | 6 |