Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Typicalness-Aware Learning for Failure Detection

About

Deep neural networks (DNNs) often suffer from the overconfidence issue, where incorrect predictions are made with high confidence scores, hindering the applications in critical systems. In this paper, we propose a novel approach called Typicalness-Aware Learning (TAL) to address this issue and improve failure detection performance. We observe that, with the cross-entropy loss, model predictions are optimized to align with the corresponding labels via increasing logit magnitude or refining logit direction. However, regarding atypical samples, the image content and their labels may exhibit disparities. This discrepancy can lead to overfitting on atypical samples, ultimately resulting in the overconfidence issue that we aim to address. To tackle the problem, we have devised a metric that quantifies the typicalness of each sample, enabling the dynamic adjustment of the logit magnitude during the training process. By allowing atypical samples to be adequately fitted while preserving reliable logit direction, the problem of overconfidence can be mitigated. TAL has been extensively evaluated on benchmark datasets, and the results demonstrate its superiority over existing failure detection methods. Specifically, TAL achieves a more than 5% improvement on CIFAR100 in terms of the Area Under the Risk-Coverage Curve (AURC) compared to the state-of-the-art. Code is available at https://github.com/liuyijungoon/TAL.

Yijun Liu, Jiequan Cui, Zhuotao Tian, Senqiao Yang, Qingdong He, Xiaoling Wang, Jingyong Su• 2024

Related benchmarks

TaskDatasetResultRank
Failure DetectionCIFAR100 vs. SVHN
AURC Score347.7
39
Failure DetectionCIFAR100 (test)
AURC90.6
39
Out-of-Distribution DetectionCIFAR100
AURC259.6
39
Out-of-Distribution DetectionImageNet vs. Textures
AURC290.5
11
Failure DetectionImageNet Old setting
AURC64.66
11
Failure DetectionImageNet vs. Textures New setting
AURC338.4
11
Failure DetectionImageNet vs. WILDS New setting
AURC288.7
10
Out-of-Distribution DetectionImageNet WILDS
AURC232.1
10
Failure DetectionCIFAR100 Old Setting
AURC27.15
5
Failure DetectionCIFAR100 New FD Setting
AURC262.6
5
Showing 10 of 11 rows

Other info

Code

Follow for update