Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Balanced Contrastive Learning for Long-Tailed Visual Recognition

About

Real-world data typically follow a long-tailed distribution, where a few majority categories occupy most of the data while most minority categories contain a limited number of samples. Classification models minimizing cross-entropy struggle to represent and classify the tail classes. Although the problem of learning unbiased classifiers has been well studied, methods for representing imbalanced data are under-explored. In this paper, we focus on representation learning for imbalanced data. Recently, supervised contrastive learning has shown promising performance on balanced data recently. However, through our theoretical analysis, we find that for long-tailed data, it fails to form a regular simplex which is an ideal geometric configuration for representation learning. To correct the optimization behavior of SCL and further improve the performance of long-tailed visual recognition, we propose a novel loss for balanced contrastive learning (BCL). Compared with SCL, we have two improvements in BCL: class-averaging, which balances the gradient contribution of negative classes; class-complement, which allows all classes to appear in every mini-batch. The proposed balanced contrastive learning (BCL) method satisfies the condition of forming a regular simplex and assists the optimization of cross-entropy. Equipped with BCL, the proposed two-branch framework can obtain a stronger feature representation and achieve competitive performance on long-tailed benchmark datasets such as CIFAR-10-LT, CIFAR-100-LT, ImageNet-LT, and iNaturalist2018. Our code is available at https://github.com/FlamieZhu/BCL .

Jianggang Zhu, Zheng Wang, Jingjing Chen, Yi-Ping Phoebe Chen, Yu-Gang Jiang• 2022

Related benchmarks

TaskDatasetResultRank
Image ClassificationiNaturalist 2018
Top-1 Accuracy71.8
287
Image ClassificationImageNet LT
Top-1 Accuracy57.2
251
Long-Tailed Image ClassificationImageNet-LT (test)
Top-1 Acc (Overall)57.1
220
Image ClassificationCIFAR-10 long-tailed (test)--
201
Image ClassificationiNaturalist 2018 (test)
Top-1 Accuracy71.1
192
Image ClassificationImageNet-LT (test)
Top-1 Acc (All)58.6
159
Image ClassificationPlaces-LT (test)--
128
Image ClassificationiNaturalist 2018 (val)
Top-1 Accuracy71.8
116
Long-tailed Visual RecognitionImageNet LT
Overall Accuracy56.7
89
Image ClassificationCIFAR-100-LT Imbalance Ratio 100
Top-1 Acc0.519
88
Showing 10 of 63 rows

Other info

Follow for update