Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Energy-Based Open-World Uncertainty Modeling for Confidence Calibration

About

Confidence calibration is of great importance to the reliability of decisions made by machine learning systems. However, discriminative classifiers based on deep neural networks are often criticized for producing overconfident predictions that fail to reflect the true correctness likelihood of classification accuracy. We argue that such an inability to model uncertainty is mainly caused by the closed-world nature in softmax: a model trained by the cross-entropy loss will be forced to classify input into one of $K$ pre-defined categories with high probability. To address this problem, we for the first time propose a novel $K$+1-way softmax formulation, which incorporates the modeling of open-world uncertainty as the extra dimension. To unify the learning of the original $K$-way classification task and the extra dimension that models uncertainty, we propose a novel energy-based objective function, and moreover, theoretically prove that optimizing such an objective essentially forces the extra dimension to capture the marginal data distribution. Extensive experiments show that our approach, Energy-based Open-World Softmax (EOW-Softmax), is superior to existing state-of-the-art methods in improving confidence calibration.

Yezhen Wang, Bo Li, Tong Che, Kaiyang Zhou, Ziwei Liu, Dongsheng Li• 2021

Related benchmarks

TaskDatasetResultRank
Unknown sample identificationSynthetic
AUROC0.862
29
Unknown sample identificationSynthetic-to-Real
AUROC74.8
28
Unknown sample identificationReal-to-Real
AUROC82.6
24
Time Series OOD GeneralizationUniMiB-SHAR
OOD Result 1 Score57.07
18
Time Series OOD GeneralizationUCIHAR, UniMiB-SHAR, EMG, Opportunity Aggregated
Average Performance73.42
18
Time Series OOD GeneralizationEMG
Accuracy 165.03
18
Time Series OOD GeneralizationUCIHAR
OOD Performance Metric 194.15
18
Time Series OOD GeneralizationOpportunity
S182.02
18
Human Activity RecognitionUCIHAR
ECE0.09
5
Human Activity RecognitionOpportunity
ECE8
5
Showing 10 of 12 rows

Other info

Follow for update