Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

$\epsilon$-Softmax: Approximating One-Hot Vectors for Mitigating Label Noise

About

Noisy labels pose a common challenge for training accurate deep neural networks. To mitigate label noise, prior studies have proposed various robust loss functions to achieve noise tolerance in the presence of label noise, particularly symmetric losses. However, they usually suffer from the underfitting issue due to the overly strict symmetric condition. In this work, we propose a simple yet effective approach for relaxing the symmetric condition, namely $\epsilon$-softmax, which simply modifies the outputs of the softmax layer to approximate one-hot vectors with a controllable error $\epsilon$. Essentially, $\epsilon$-softmax not only acts as an alternative for the softmax layer, but also implicitly plays the crucial role in modifying the loss function. We prove theoretically that $\epsilon$-softmax can achieve noise-tolerant learning with controllable excess risk bound for almost any loss function. Recognizing that $\epsilon$-softmax-enhanced losses may slightly reduce fitting ability on clean datasets, we further incorporate them with one symmetric loss, thereby achieving a better trade-off between robustness and effective learning. Extensive experiments demonstrate the superiority of our method in mitigating synthetic and real-world label noise. The code is available at https://github.com/cswjl/eps-softmax.

Jialiang Wang, Xiong Zhou, Deming Zhai, Junjun Jiang, Xiangyang Ji, Xianming Liu• 2025

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet (val)
Top-1 Acc67.2
1206
Image ClassificationClothing1M (test)
Accuracy69.85
546
Image ClassificationCIFAR-10N (Worst)
Accuracy95.12
78
Image ClassificationCIFAR-10N (Aggregate)
Accuracy95.95
74
Image ClassificationWebVision 1.0 (val)
Top-1 Acc71.32
59
Image ClassificationCIFAR-10N (Random 1)
Accuracy95.79
36
Image ClassificationCIFAR-10N (Random 2)
Accuracy95.91
34
Image ClassificationCIFAR-10N (Random 3)
Accuracy95.96
34
Image ClassificationCIFAR-100 Noisy
Accuracy71.97
19
Image ClassificationCIFAR-100 instance-dependent noise (IDN) (test)
Acc (η=0.2)67.44
18
Showing 10 of 11 rows

Other info

Code

Follow for update