Typicalness-Aware Learning for Failure Detection

About

Deep neural networks (DNNs) often suffer from the overconfidence issue, where incorrect predictions are made with high confidence scores, hindering the applications in critical systems. In this paper, we propose a novel approach called Typicalness-Aware Learning (TAL) to address this issue and improve failure detection performance. We observe that, with the cross-entropy loss, model predictions are optimized to align with the corresponding labels via increasing logit magnitude or refining logit direction. However, regarding atypical samples, the image content and their labels may exhibit disparities. This discrepancy can lead to overfitting on atypical samples, ultimately resulting in the overconfidence issue that we aim to address. To tackle the problem, we have devised a metric that quantifies the typicalness of each sample, enabling the dynamic adjustment of the logit magnitude during the training process. By allowing atypical samples to be adequately fitted while preserving reliable logit direction, the problem of overconfidence can be mitigated. TAL has been extensively evaluated on benchmark datasets, and the results demonstrate its superiority over existing failure detection methods. Specifically, TAL achieves a more than 5% improvement on CIFAR100 in terms of the Area Under the Risk-Coverage Curve (AURC) compared to the state-of-the-art. Code is available at https://github.com/liuyijungoon/TAL.

Yijun Liu, Jiequan Cui, Zhuotao Tian, Senqiao Yang, Qingdong He, Xiaoling Wang, Jingyong Su• 2024

Related benchmarks

Task	Dataset	Result
Failure Detection	CIFAR100 vs. SVHN	AURC Score347.7	39
Failure Detection	CIFAR100 (test)	AURC90.6	39
Out-of-Distribution Detection	CIFAR100	AURC259.6	39
Out-of-Distribution Detection	ImageNet vs. Textures	AURC290.5	11
Failure Detection	ImageNet Old setting	AURC64.66	11
Failure Detection	ImageNet vs. Textures New setting	AURC338.4	11
Failure Detection	ImageNet vs. WILDS New setting	AURC288.7	10
Out-of-Distribution Detection	ImageNet WILDS	AURC232.1	10
Failure Detection	CIFAR100 Old Setting	AURC27.15	5
Failure Detection	CIFAR100 New FD Setting	AURC262.6	5

Showing 10 of 11 rows

Other info

Code

Follow for update

@wizwand_team Discord