Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Revisiting Essential and Nonessential Settings of Evidential Deep Learning

About

Evidential Deep Learning (EDL) is an emerging method for uncertainty estimation that provides reliable predictive uncertainty in a single forward pass, attracting significant attention. Grounded in subjective logic, EDL derives Dirichlet concentration parameters from neural networks to construct a Dirichlet probability density function (PDF), modeling the distribution of class probabilities. Despite its success, EDL incorporates several nonessential settings: In model construction, (1) a commonly ignored prior weight parameter is fixed to the number of classes, while its value actually impacts the balance between the proportion of evidence and its magnitude in deriving predictive scores. In model optimization, (2) the empirical risk features a variance-minimizing optimization term that biases the PDF towards a Dirac delta function, potentially exacerbating overconfidence. (3) Additionally, the structural risk typically includes a KL-divergence-minimizing regularization, whose optimization direction extends beyond the intended purpose and contradicts common sense, diminishing the information carried by the evidence magnitude. Therefore, we propose Re-EDL, a simplified yet more effective variant of EDL, by relaxing the nonessential settings and retaining the essential one, namely, the adoption of projected probability from subjective logic. Specifically, Re-EDL treats the prior weight as an adjustable hyperparameter rather than a fixed scalar, and directly optimizes the expectation of the Dirichlet PDF provided by deprecating both the variance-minimizing optimization term and the divergence regularization term. Extensive experiments and state-of-the-art performance validate the effectiveness of our method. The source code is available at https://github.com/MengyuanChen21/Re-EDL.

Mengyuan Chen, Junyu Gao, Changsheng Xu• 2024

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-10 (test)
Accuracy90.13
882
Image ClassificationCIFAR-10--
875
Image ClassificationCIFAR-100
Accuracy64.53
357
OOD DetectionCIFAR-10 (IND) SVHN (OOD)
AUROC0.9222
138
Image ClassificationCIFAR-10-LT
Top-1 Accuracy66.27
127
OOD DetectionCIFAR-10 (test)
AUROC85.46
115
OOD DetectionCIFAR-100 IND SVHN OOD
AUROC (%)77.89
81
Model CalibrationCIFAR10 (test)--
68
Out-of-Distribution DetectionCIFAR-10 ID CIFAR-100 OOD--
66
OOD DetectionCIFAR100 ID TImageNet OOD
AUROC0.7467
38
Showing 10 of 42 rows

Other info

Follow for update