Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learn to Accumulate Evidence from All Training Samples: Theory and Practice

About

Evidential deep learning, built upon belief theory and subjective logic, offers a principled and computationally efficient way to turn a deterministic neural network uncertainty-aware. The resultant evidential models can quantify fine-grained uncertainty using the learned evidence. To ensure theoretically sound evidential models, the evidence needs to be non-negative, which requires special activation functions for model training and inference. This constraint often leads to inferior predictive performance compared to standard softmax models, making it challenging to extend them to many large-scale datasets. To unveil the real cause of this undesired behavior, we theoretically investigate evidential models and identify a fundamental limitation that explains the inferior performance: existing evidential activation functions create zero evidence regions, which prevent the model to learn from training samples falling into such regions. A deeper analysis of evidential activation functions based on our theoretical underpinning inspires the design of a novel regularizer that effectively alleviates this fundamental limitation. Extensive experiments over many challenging real-world datasets and settings confirm our theoretical findings and demonstrate the effectiveness of our proposed approach.

Deep Pandey, Qi Yu• 2023

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-10 (test)
Accuracy89.43
882
Model CalibrationCIFAR10 (test)
ECE4.97
68
Out-of-Distribution DetectionCIFAR-10 ID CIFAR-100 OOD--
66
Misclassification DetectionCIFAR-10 (test)
AUPR Succ98.82
15
OOD DetectionMNIST → KMNIST--
13
Out-of-Distribution DetectionSVHN OOD CIFAR-10 ID
AUPR82.85
12
OOD DetectionCIFAR-10 vs SVHN
Maximum Probability (MP)90.1
7
OOD DetectionMNIST vs FMNIST
MP99.34
7
ID ClassificationMNIST
Accuracy99.38
7
ID ClassificationCIFAR-10
Accuracy89.8
7
Showing 10 of 11 rows

Other info

Follow for update