Nested Collaborative Learning for Long-Tailed Visual Recognition

About

The networks trained on the long-tailed dataset vary remarkably, despite the same training settings, which shows the great uncertainty in long-tailed learning. To alleviate the uncertainty, we propose a Nested Collaborative Learning (NCL), which tackles the problem by collaboratively learning multiple experts together. NCL consists of two core components, namely Nested Individual Learning (NIL) and Nested Balanced Online Distillation (NBOD), which focus on the individual supervised learning for each single expert and the knowledge transferring among multiple experts, respectively. To learn representations more thoroughly, both NIL and NBOD are formulated in a nested way, in which the learning is conducted on not just all categories from a full perspective but some hard categories from a partial perspective. Regarding the learning in the partial perspective, we specifically select the negative categories with high predicted scores as the hard categories by using a proposed Hard Category Mining (HCM). In the NCL, the learning from two perspectives is nested, highly related and complementary, and helps the network to capture not only global and robust features but also meticulous distinguishing ability. Moreover, self-supervision is further utilized for feature enhancement. Extensive experiments manifest the superiority of our method with outperforming the state-of-the-art whether by using a single model or an ensemble.

Jun Li, Zichang Tan, Jun Wan, Zhen Lei, Guodong Guo• 2022

Related benchmarks

Task	Dataset	Result
Long-Tailed Image Classification	ImageNet-LT (test)	Top-1 Acc (Overall)60.5	246
Image Classification	ImageNet-LT (test)	Top-1 Acc (All)57.4	159
Image Classification	Places-LT (test)	--	128
Image Classification	CIFAR-100-LT Imbalance Ratio 100	Top-1 Acc0.542	88
Image Classification	CIFAR-10-LT (IF 50)	Top-1 Accuracy82.9	88
Image Classification	CIFAR-100 LT (IF=50)	Top-1 Acc47.4	82
Image Classification	CIFAR-10-LT IF 100	Top-1 Accuracy79.7	78
Image Classification	CIFAR-100-LT IF 100 (test)	Top-1 Acc54.2	77
Long-Tailed Image Classification	Places-LT (test)	Accuracy41.8	74
Long-tailed recognition	Places-LT (test)	Accuracy (Overall)41.8	71

Showing 10 of 30 rows

Other info

Code

Follow for update

@wizwand_team Discord