Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Post-hoc Uncertainty Learning using a Dirichlet Meta-Model

About

It is known that neural networks have the problem of being over-confident when directly using the output label distribution to generate uncertainty measures. Existing methods mainly resolve this issue by retraining the entire model to impose the uncertainty quantification capability so that the learned model can achieve desired performance in accuracy and uncertainty prediction simultaneously. However, training the model from scratch is computationally expensive and may not be feasible in many situations. In this work, we consider a more practical post-hoc uncertainty learning setting, where a well-trained base model is given, and we focus on the uncertainty quantification task at the second stage of training. We propose a novel Bayesian meta-model to augment pre-trained models with better uncertainty quantification abilities, which is effective and computationally efficient. Our proposed method requires no additional training data and is flexible enough to quantify different uncertainties and easily adapt to different application settings, including out-of-domain data detection, misclassification detection, and trustworthy transfer learning. We demonstrate our proposed meta-model approach's flexibility and superior empirical performance on these applications over multiple representative image classification benchmarks.

Maohao Shen, Yuheng Bu, Prasanna Sattigeri, Soumya Ghosh, Subhro Das, Gregory Wornell• 2022

Related benchmarks

TaskDatasetResultRank
Out-of-Distribution DetectionCIFAR-10
AUROC90.65
121
Out-of-Distribution DetectionImageNet--
108
Out-of-Distribution DetectionCIFAR-10 vs CIFAR-100
AUROC82.43
70
Multiple-choice Question AnsweringOBQA
Accuracy93
69
Multiple-choice Question AnsweringRACE
Accuracy89.42
54
Out-of-Distribution DetectionOBQA to MMLU
AUROC78.98
41
Out-of-Distribution DetectionRACE to MMLU
AUROC79.72
41
Out-of-Distribution DetectionCIFAR10 vs. SVHN
AUROC85.68
31
Out-of-Distribution DetectionImageNet-R
ROC AUC0.4825
28
Uncertainty EstimationRACE Llama-3.1-8B and Gemma-2-9B backbones (test)
AUROC88.96
24
Showing 10 of 26 rows

Other info

Follow for update