Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Trustworthy Long-Tailed Classification

About

Classification on long-tailed distributed data is a challenging problem, which suffers from serious class-imbalance and accordingly unpromising performance especially on tail classes. Recently, the ensembling based methods achieve the state-of-the-art performance and show great potential. However, there are two limitations for current methods. First, their predictions are not trustworthy for failure-sensitive applications. This is especially harmful for the tail classes where the wrong predictions is basically frequent. Second, they assign unified numbers of experts to all samples, which is redundant for easy samples with excessive computational cost. To address these issues, we propose a Trustworthy Long-tailed Classification (TLC) method to jointly conduct classification and uncertainty estimation to identify hard samples in a multi-expert framework. Our TLC obtains the evidence-based uncertainty (EvU) and evidence for each expert, and then combines these uncertainties and evidences under the Dempster-Shafer Evidence Theory (DST). Moreover, we propose a dynamic expert engagement to reduce the number of engaged experts for easy samples and achieve efficiency while maintaining promising performances. Finally, we conduct comprehensive experiments on the tasks of classification, tail detection, OOD detection and failure prediction. The experimental results show that the proposed TLC outperforms existing methods and is trustworthy with reliable uncertainty.

Bolian Li, Zongbo Han, Haining Li, Huazhu Fu, Changqing Zhang• 2021

Related benchmarks

TaskDatasetResultRank
Long-Tailed Image ClassificationImageNet-LT (test)
Top-1 Acc (Overall)55.1
220
Long-Tailed Image ClassificationCIFAR-100-LT Imbalance Ratio 100
Top-1 Acc49.8
47
Image ClassificationCIFAR-10-LT Imbalance Ratio 100 (test)
Top-1 Accuracy80.3
32
Image ClassificationImageNet-LT IR=256 (test)
Accuracy54.6
32
Image ClassificationCIFAR-100-LT IR=100 (test)
Accuracy49
25
Remote Sensing ClassificationDOTA (test)
Top-1 Accuracy (Head)88.25
11
Remote Sensing ClassificationFGSC-23 (test)
Top-1 Acc (Head)29
11
Failure PredictionCIFAR-100-LT (test)
AUC (All)77.9
9
Failure PredictionImageNet-LT (test)
AUC (All)67.2
9
Long-tailed classificationCIFAR-10 long-tail (test)
Accuracy (All)0.804
9
Showing 10 of 15 rows

Other info

Code

Follow for update