Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Sample Difficulty from Pre-trained Models for Reliable Prediction

About

Large-scale pre-trained models have achieved remarkable success in many applications, but how to leverage them to improve the prediction reliability of downstream models is undesirably under-explored. Moreover, modern neural networks have been found to be poorly calibrated and make overconfident predictions regardless of inherent sample difficulty and data uncertainty. To address this issue, we propose to utilize large-scale pre-trained models to guide downstream model training with sample difficulty-aware entropy regularization. Pre-trained models that have been exposed to large-scale datasets and do not overfit the downstream training classes enable us to measure each training sample's difficulty via feature-space Gaussian modeling and relative Mahalanobis distance computation. Importantly, by adaptively penalizing overconfident prediction based on the sample difficulty, we simultaneously improve accuracy and uncertainty calibration across challenging benchmarks (e.g., +0.55% ACC and -3.7% ECE on ImageNet1k using ResNet34), consistently surpassing competitive baselines for reliable prediction. The improved uncertainty estimate further improves selective classification (abstaining from erroneous predictions) and out-of-distribution detection.

Peng Cui, Dan Zhang, Zhijie Deng, Yinpeng Dong, Jun Zhu• 2023

Related benchmarks

TaskDatasetResultRank
Out-of-Distribution DetectionCIFAR-10 vs CIFAR-100 (test)--
93
Image ClassificationCIFAR-10 (test)
Accuracy95.67
59
Misclassification DetectionImageNet-1k (val)
FPR@95% (MSP)45.69
7
Out-of-Distribution DetectionImageNet-1k vs iNaturalist (val test)
FPR@95% (MaxLogit)32.17
7
Image ClassificationImageNet1K (val)--
6
Image ClassificationImageNet-1k (val)
Accuracy (Top-1)74.11
5
Showing 6 of 6 rows

Other info

Follow for update