Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Label Distributionally Robust Losses for Multi-class Classification: Consistency, Robustness and Adaptivity

About

We study a family of loss functions named label-distributionally robust (LDR) losses for multi-class classification that are formulated from distributionally robust optimization (DRO) perspective, where the uncertainty in the given label information are modeled and captured by taking the worse case of distributional weights. The benefits of this perspective are several fold: (i) it provides a unified framework to explain the classical cross-entropy (CE) loss and SVM loss and their variants, (ii) it includes a special family corresponding to the temperature-scaled CE loss, which is widely adopted but poorly understood; (iii) it allows us to achieve adaptivity to the uncertainty degree of label information at an instance level. Our contributions include: (1) we study both consistency and robustness by establishing top-$k$ ($\forall k\geq 1$) consistency of LDR losses for multi-class classification, and a negative result that a top-$1$ consistent and symmetric robust loss cannot achieve top-$k$ consistency simultaneously for all $k\geq 2$; (2) we propose a new adaptive LDR loss that automatically adapts the individualized temperature parameter to the noise degree of class label of each instance; (3) we demonstrate stable and competitive performance for the proposed adaptive LDR loss on 7 benchmark datasets under 6 noisy label and 1 clean settings against 13 loss functions, and on one real-world noisy dataset. The code is open-sourced at \url{https://github.com/Optimization-AI/ICML2023_LDR}.

Dixian Zhu, Yiming Ying, Tianbao Yang• 2021

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet (val)
Top-1 Acc65.24
1206
Image ClassificationClothing1M (test)
Accuracy66.88
546
Image ClassificationWebVision 1.0 (val)
Top-1 Acc69.64
59
Image ClassificationCIFAR-10 instance-dependent noise (IDN) (test)
Accuracy (η=0.2)88.99
18
Image ClassificationCIFAR-100 instance-dependent noise (IDN) (test)
Acc (η=0.2)59.19
18
Showing 5 of 5 rows

Other info

Follow for update