Class-Balanced Distillation for Long-Tailed Visual Recognition

About

Real-world imagery is often characterized by a significant imbalance of the number of images per class, leading to long-tailed distributions. An effective and simple approach to long-tailed visual recognition is to learn feature representations and a classifier separately, with instance and class-balanced sampling, respectively. In this work, we introduce a new framework, by making the key observation that a feature representation learned with instance sampling is far from optimal in a long-tailed setting. Our main contribution is a new training method, referred to as Class-Balanced Distillation (CBD), that leverages knowledge distillation to enhance feature representations. CBD allows the feature representation to evolve in the second training stage, guided by the teacher learned in the first stage. The second stage uses class-balanced sampling, in order to focus on under-represented classes. This framework can naturally accommodate the usage of multiple teachers, unlocking the information from an ensemble of models to enhance recognition capabilities. Our experiments show that the proposed technique consistently outperforms the state of the art on long-tailed recognition benchmarks such as ImageNet-LT, iNaturalist17 and iNaturalist18.

Ahmet Iscen, Andr\'e Araujo, Boqing Gong, Cordelia Schmid• 2021

Related benchmarks

Task	Dataset	Result
Long-Tailed Image Classification	ImageNet-LT (test)	--	246
Image Classification	iNaturalist 2018 (test)	--	223
Image Classification	ImageNet-LT (test)	Top-1 Acc (All)55.6	159
Long-tailed Visual Recognition	ImageNet LT	Overall Accuracy57.7	89
Long-Tailed Image Classification	iNaturalist 2018	Accuracy73.6	82
Image Classification	CIFAR-100-LT Imbalance Ratio 100 (test)	Accuracy44.83	62
Image Classification	CIFAR-100 Imbalance Ratio LT-50 (test)	Accuracy49.19	62
Image Classification	CIFAR-100 LT Imbalance Ratio 10 (test)	Accuracy60.85	59
Long-tailed Visual Recognition	iNaturalist 2018 (test)	Accuracy75.3	39
Long-tailed Visual Recognition	iNaturalist 2017 (test)	Accuracy71.3	16

Showing 10 of 13 rows

Other info

Code

Follow for update

@wizwand_team Discord