Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Credal Wrapper of Model Averaging for Uncertainty Estimation in Classification

About

This paper presents an innovative approach, called credal wrapper, to formulating a credal set representation of model averaging for Bayesian neural networks (BNNs) and deep ensembles (DEs), capable of improving uncertainty estimation in classification tasks. Given a finite collection of single predictive distributions derived from BNNs or DEs, the proposed credal wrapper approach extracts an upper and a lower probability bound per class, acknowledging the epistemic uncertainty due to the availability of a limited amount of distributions. Such probability intervals over classes can be mapped on a convex set of probabilities (a credal set) from which, in turn, a unique prediction can be obtained using a transformation called intersection probability transformation. In this article, we conduct extensive experiments on several out-of-distribution (OOD) detection benchmarks, encompassing various dataset pairs (CIFAR10/100 vs SVHN/Tiny-ImageNet, CIFAR10 vs CIFAR10-C, CIFAR100 vs CIFAR100-C and ImageNet vs ImageNet-O) and using different network architectures (such as VGG16, ResNet-18/50, EfficientNet B2, and ViT Base). Compared to the BNN and DE baselines, the proposed credal wrapper method exhibits superior performance in uncertainty estimation and achieves a lower expected calibration error on corrupted data.

Kaizheng Wang, Fabio Cuzzolin, Keivan Shariatmadar, David Moens, Hans Hallez• 2024

Related benchmarks

TaskDatasetResultRank
OOD DetectionCIFAR-10 (IND) SVHN (OOD)
AUROC0.961
91
OOD DetectionCIFAR-10 (ID) vs Places 365 (OOD)
AUROC91.6
77
OOD DetectionCIFAR10 ID FMNIST OOD
AUROC0.952
54
OOD DetectionCIFAR10 (ID) vs CIFAR100 (OOD) (test)
AUROC91.6
36
OOD DetectionImageNet (OOD) with CIFAR10 (ID)
AUROC89.9
36
OOD DetectionCIFAR10 (ID) vs CIFAR100 (OOD)
AUROC91.6
8
Showing 6 of 6 rows

Other info

Follow for update