Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Class-Balanced Distillation for Long-Tailed Visual Recognition

About

Real-world imagery is often characterized by a significant imbalance of the number of images per class, leading to long-tailed distributions. An effective and simple approach to long-tailed visual recognition is to learn feature representations and a classifier separately, with instance and class-balanced sampling, respectively. In this work, we introduce a new framework, by making the key observation that a feature representation learned with instance sampling is far from optimal in a long-tailed setting. Our main contribution is a new training method, referred to as Class-Balanced Distillation (CBD), that leverages knowledge distillation to enhance feature representations. CBD allows the feature representation to evolve in the second training stage, guided by the teacher learned in the first stage. The second stage uses class-balanced sampling, in order to focus on under-represented classes. This framework can naturally accommodate the usage of multiple teachers, unlocking the information from an ensemble of models to enhance recognition capabilities. Our experiments show that the proposed technique consistently outperforms the state of the art on long-tailed recognition benchmarks such as ImageNet-LT, iNaturalist17 and iNaturalist18.

Ahmet Iscen, Andr\'e Araujo, Boqing Gong, Cordelia Schmid• 2021

Related benchmarks

TaskDatasetResultRank
Long-Tailed Image ClassificationImageNet-LT (test)--
220
Image ClassificationiNaturalist 2018 (test)--
192
Image ClassificationImageNet-LT (test)
Top-1 Acc (All)55.6
159
Long-tailed Visual RecognitionImageNet LT
Overall Accuracy57.7
89
Long-Tailed Image ClassificationiNaturalist 2018
Accuracy73.6
82
Image ClassificationCIFAR-100-LT Imbalance Ratio 100 (test)
Accuracy44.83
62
Image ClassificationCIFAR-100 Imbalance Ratio LT-50 (test)
Accuracy49.19
62
Image ClassificationCIFAR-100 LT Imbalance Ratio 10 (test)
Accuracy60.85
59
Long-tailed Visual RecognitioniNaturalist 2018 (test)
Accuracy75.3
39
Long-tailed Visual RecognitioniNaturalist 2017 (test)
Accuracy71.3
16
Showing 10 of 13 rows

Other info

Code

Follow for update