Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Intra Order-preserving Functions for Calibration of Multi-Class Neural Networks

About

Predicting calibrated confidence scores for multi-class deep networks is important for avoiding rare but costly mistakes. A common approach is to learn a post-hoc calibration function that transforms the output of the original network into calibrated confidence scores while maintaining the network's accuracy. However, previous post-hoc calibration techniques work only with simple calibration functions, potentially lacking sufficient representation to calibrate the complex function landscape of deep networks. In this work, we aim to learn general post-hoc calibration functions that can preserve the top-k predictions of any deep network. We call this family of functions intra order-preserving functions. We propose a new neural network architecture that represents a class of intra order-preserving functions by combining common neural network components. Additionally, we introduce order-invariant and diagonal sub-families, which can act as regularization for better generalization when the training data size is small. We show the effectiveness of the proposed method across a wide range of datasets and classifiers. Our method outperforms state-of-the-art post-hoc calibration methods, namely temperature scaling and Dirichlet calibration, in several evaluation metrics for the task.

Amir Rahimi, Amirreza Shaban, Ching-An Cheng, Richard Hartley, Byron Boots• 2020

Related benchmarks

TaskDatasetResultRank
Confidence calibrationDermatology
Confidence Calibration Error0.063
66
Confidence calibrationGlass
Calibration Error0.12
44
Confidence calibrationvehicle
Calibration Error0.058
44
Confidence calibrationCAR
Calibration Error6.3
44
Confidence calibrationCiteseer
ECE6.08
36
Confidence calibrationPubmed
ECE1.23
36
Confidence calibrationCora
ECE4.24
36
ClassificationGlass
Accuracy69.8
32
Classificationvehicle
Accuracy82.9
30
Confidence calibrationCoraFull
ECE8.33
28
Showing 10 of 21 rows

Other info

Follow for update