Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Balancing Two Classifiers via A Simplex ETF Structure for Model Calibration

About

In recent years, deep neural networks (DNNs) have demonstrated state-of-the-art performance across various domains. However, despite their success, they often face calibration issues, particularly in safety-critical applications such as autonomous driving and healthcare, where unreliable predictions can have serious consequences. Recent research has started to improve model calibration from the view of the classifier. However, the exploration of designing the classifier to solve the model calibration problem is insufficient. Let alone most of the existing methods ignore the calibration errors arising from underconfidence. In this work, we propose a novel method by balancing learnable and ETF classifiers to solve the overconfidence or underconfidence problem for model Calibration named BalCAL. By introducing a confidence-tunable module and a dynamic adjustment method, we ensure better alignment between model confidence and its true accuracy. Extensive experimental validation shows that ours significantly improves model calibration performance while maintaining high predictive accuracy, outperforming existing techniques. This provides a novel solution to the calibration challenges commonly encountered in deep learning.

Jiani Ni, He Zhao, Jintong Gao, Dandan Guo, Hongyuan Zha• 2025

Related benchmarks

TaskDatasetResultRank
OOD DetectionCIFAR-100 standard (test)
AUROC (%)81.26
94
Image Classification CalibrationCIFAR100
Classwise ECE0.0096
62
OOD DetectionSVHN (test)
AUROC0.9498
61
Model CalibrationCIFAR-100
ECE4.21
53
Model CalibrationSVHN
ECE0.24
40
Model CalibrationCIFAR-10
ECE0.76
40
OOD DetectionCIFAR-10 (test)
AUROC89.89
40
Model CalibrationTiny-ImageNet
Accuracy66.9
18
Model CalibrationCIFAR-10, CIFAR-100, and SVHN
Average ECE1.77
13
Image Classification CalibrationImageNet
Accuracy82.32
6
Showing 10 of 10 rows

Other info

Follow for update