Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Nested Collaborative Learning for Long-Tailed Visual Recognition

About

The networks trained on the long-tailed dataset vary remarkably, despite the same training settings, which shows the great uncertainty in long-tailed learning. To alleviate the uncertainty, we propose a Nested Collaborative Learning (NCL), which tackles the problem by collaboratively learning multiple experts together. NCL consists of two core components, namely Nested Individual Learning (NIL) and Nested Balanced Online Distillation (NBOD), which focus on the individual supervised learning for each single expert and the knowledge transferring among multiple experts, respectively. To learn representations more thoroughly, both NIL and NBOD are formulated in a nested way, in which the learning is conducted on not just all categories from a full perspective but some hard categories from a partial perspective. Regarding the learning in the partial perspective, we specifically select the negative categories with high predicted scores as the hard categories by using a proposed Hard Category Mining (HCM). In the NCL, the learning from two perspectives is nested, highly related and complementary, and helps the network to capture not only global and robust features but also meticulous distinguishing ability. Moreover, self-supervision is further utilized for feature enhancement. Extensive experiments manifest the superiority of our method with outperforming the state-of-the-art whether by using a single model or an ensemble.

Jun Li, Zichang Tan, Jun Wan, Zhen Lei, Guodong Guo• 2022

Related benchmarks

TaskDatasetResultRank
Long-Tailed Image ClassificationImageNet-LT (test)
Top-1 Acc (Overall)60.5
220
Image ClassificationImageNet-LT (test)
Top-1 Acc (All)57.4
159
Image ClassificationPlaces-LT (test)--
128
Image ClassificationCIFAR-100-LT Imbalance Ratio 100
Top-1 Acc0.542
88
Image ClassificationCIFAR-100-LT IF 100 (test)
Top-1 Acc54.2
77
Long-tailed recognitionPlaces-LT (test)
Accuracy (Overall)41.8
71
Image ClassificationCIFAR-100-LT (Imbalance Ratio 50)
Top-1 Accuracy58.2
61
Long-Tailed Image ClassificationPlaces-LT (test)
Accuracy41.8
61
Long-tail Image ClassificationiNaturalist 2018 (test)
Accuracy (Few)73.8
59
Long-tailed Visual RecognitioniNaturalist 2018 (test)
Accuracy74.9
39
Showing 10 of 30 rows

Other info

Code

Follow for update