Credal Wrapper of Model Averaging for Uncertainty Estimation in Classification
About
This paper presents an innovative approach, called credal wrapper, to formulating a credal set representation of model averaging for Bayesian neural networks (BNNs) and deep ensembles (DEs), capable of improving uncertainty estimation in classification tasks. Given a finite collection of single predictive distributions derived from BNNs or DEs, the proposed credal wrapper approach extracts an upper and a lower probability bound per class, acknowledging the epistemic uncertainty due to the availability of a limited amount of distributions. Such probability intervals over classes can be mapped on a convex set of probabilities (a credal set) from which, in turn, a unique prediction can be obtained using a transformation called intersection probability transformation. In this article, we conduct extensive experiments on several out-of-distribution (OOD) detection benchmarks, encompassing various dataset pairs (CIFAR10/100 vs SVHN/Tiny-ImageNet, CIFAR10 vs CIFAR10-C, CIFAR100 vs CIFAR100-C and ImageNet vs ImageNet-O) and using different network architectures (such as VGG16, ResNet-18/50, EfficientNet B2, and ViT Base). Compared to the BNN and DE baselines, the proposed credal wrapper method exhibits superior performance in uncertainty estimation and achieves a lower expected calibration error on corrupted data.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| OOD Detection | CIFAR-10 (IND) SVHN (OOD) | AUROC0.961 | 91 | |
| OOD Detection | CIFAR-10 (ID) vs Places 365 (OOD) | AUROC91.6 | 77 | |
| OOD Detection | CIFAR10 ID FMNIST OOD | AUROC0.952 | 54 | |
| OOD Detection | CIFAR10 (ID) vs CIFAR100 (OOD) (test) | AUROC91.6 | 36 | |
| OOD Detection | ImageNet (OOD) with CIFAR10 (ID) | AUROC89.9 | 36 | |
| OOD Detection | CIFAR10 (ID) vs CIFAR100 (OOD) | AUROC91.6 | 8 |