Balancing Two Classifiers via A Simplex ETF Structure for Model Calibration

About

In recent years, deep neural networks (DNNs) have demonstrated state-of-the-art performance across various domains. However, despite their success, they often face calibration issues, particularly in safety-critical applications such as autonomous driving and healthcare, where unreliable predictions can have serious consequences. Recent research has started to improve model calibration from the view of the classifier. However, the exploration of designing the classifier to solve the model calibration problem is insufficient. Let alone most of the existing methods ignore the calibration errors arising from underconfidence. In this work, we propose a novel method by balancing learnable and ETF classifiers to solve the overconfidence or underconfidence problem for model Calibration named BalCAL. By introducing a confidence-tunable module and a dynamic adjustment method, we ensure better alignment between model confidence and its true accuracy. Extensive experimental validation shows that ours significantly improves model calibration performance while maintaining high predictive accuracy, outperforming existing techniques. This provides a novel solution to the calibration challenges commonly encountered in deep learning.

Jiani Ni, He Zhao, Jintong Gao, Dandan Guo, Hongyuan Zha• 2025

Related benchmarks

Task	Dataset	Result
Model Calibration	CIFAR-100	ECE4.21	150
OOD Detection	CIFAR-10 (test)	AUROC89.89	115
Image Classification Calibration	CIFAR100	Classwise ECE0.0096	99
OOD Detection	CIFAR-100 standard (test)	AUROC (%)81.26	94
Model Calibration	Tiny-ImageNet	Expected Calibration Error1.35	92
OOD Detection	SVHN (test)	AUROC0.9498	84
Model Calibration	CIFAR-10	ECE0.76	68
Model Calibration	SVHN	ECE0.24	40
Image Classification Calibration	ImageNet	Expected Calibration Error1.48	23
Image Classification	CIFAR-100 (test)	Accuracy (Test)81.34	16

Showing 10 of 11 rows

Other info

Follow for update

@wizwand_team Discord