Balanced Contrastive Learning for Long-Tailed Visual Recognition

About

Real-world data typically follow a long-tailed distribution, where a few majority categories occupy most of the data while most minority categories contain a limited number of samples. Classification models minimizing cross-entropy struggle to represent and classify the tail classes. Although the problem of learning unbiased classifiers has been well studied, methods for representing imbalanced data are under-explored. In this paper, we focus on representation learning for imbalanced data. Recently, supervised contrastive learning has shown promising performance on balanced data recently. However, through our theoretical analysis, we find that for long-tailed data, it fails to form a regular simplex which is an ideal geometric configuration for representation learning. To correct the optimization behavior of SCL and further improve the performance of long-tailed visual recognition, we propose a novel loss for balanced contrastive learning (BCL). Compared with SCL, we have two improvements in BCL: class-averaging, which balances the gradient contribution of negative classes; class-complement, which allows all classes to appear in every mini-batch. The proposed balanced contrastive learning (BCL) method satisfies the condition of forming a regular simplex and assists the optimization of cross-entropy. Equipped with BCL, the proposed two-branch framework can obtain a stronger feature representation and achieve competitive performance on long-tailed benchmark datasets such as CIFAR-10-LT, CIFAR-100-LT, ImageNet-LT, and iNaturalist2018. Our code is available at https://github.com/FlamieZhu/BCL .

Jianggang Zhu, Zheng Wang, Jingjing Chen, Yi-Ping Phoebe Chen, Yu-Gang Jiang• 2022

Related benchmarks

Task	Dataset	Result
Image Classification	iNaturalist 2018	Top-1 Accuracy71.8	291
Image Classification	ImageNet LT	Top-1 Accuracy57.2	264
Long-Tailed Image Classification	ImageNet-LT (test)	Top-1 Acc (Overall)57.1	246
Image Classification	iNaturalist 2018 (test)	Top-1 Accuracy71.8	223
Image Classification	CIFAR-10 long-tailed (test)	--	211
Image Classification	ImageNet-LT (test)	Top-1 Acc (All)58.6	159
Image Classification	CIFAR-100 LT	Top-1 Acc64.87	131
Image Classification	Places-LT (test)	--	128
Image Classification	CIFAR-10-LT	Top-1 Accuracy91.1	127
Image Classification	iNaturalist 2018 (val)	Top-1 Accuracy71.8	116

Showing 10 of 67 rows

Other info

Follow for update

@wizwand_team Discord