Credal Wrapper of Model Averaging for Uncertainty Estimation in Classification

About

This paper presents an innovative approach, called credal wrapper, to formulating a credal set representation of model averaging for Bayesian neural networks (BNNs) and deep ensembles (DEs), capable of improving uncertainty estimation in classification tasks. Given a finite collection of single predictive distributions derived from BNNs or DEs, the proposed credal wrapper approach extracts an upper and a lower probability bound per class, acknowledging the epistemic uncertainty due to the availability of a limited amount of distributions. Such probability intervals over classes can be mapped on a convex set of probabilities (a credal set) from which, in turn, a unique prediction can be obtained using a transformation called intersection probability transformation. In this article, we conduct extensive experiments on several out-of-distribution (OOD) detection benchmarks, encompassing various dataset pairs (CIFAR10/100 vs SVHN/Tiny-ImageNet, CIFAR10 vs CIFAR10-C, CIFAR100 vs CIFAR100-C and ImageNet vs ImageNet-O) and using different network architectures (such as VGG16, ResNet-18/50, EfficientNet B2, and ViT Base). Compared to the BNN and DE baselines, the proposed credal wrapper method exhibits superior performance in uncertainty estimation and achieves a lower expected calibration error on corrupted data.

Kaizheng Wang, Fabio Cuzzolin, Keivan Shariatmadar, David Moens, Hans Hallez• 2024

Related benchmarks

Task	Dataset	Result
OOD Detection	CIFAR-10 (IND) SVHN (OOD)	AUROC0.961	152
OOD Detection	CIFAR-10 (ID) vs Places 365 (OOD)	AUROC91.6	131
Out-of-Distribution Detection	ImageNet	AUROC89	113
OOD Detection	CIFAR10 ID FMNIST OOD	AUROC0.952	54
OOD Detection	CIFAR10 (ID) vs CIFAR100 (OOD) (test)	AUROC91.6	36
OOD Detection	ImageNet (OOD) with CIFAR10 (ID)	AUROC89.9	36
Out-of-Distribution Detection	FMNIST	OOD Score95.2	26
Out-of-Distribution Detection	Places365	AUROC91.6	21
OOD Detection	CIFAR10 (ID) vs CIFAR100 (OOD)	AUROC91.6	15

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord