Balanced Meta-Softmax for Long-Tailed Visual Recognition

About

Deep classifiers have achieved great success in visual recognition. However, real-world data is long-tailed by nature, leading to the mismatch between training and testing distributions. In this paper, we show that the Softmax function, though used in most classification tasks, gives a biased gradient estimation under the long-tailed setup. This paper presents Balanced Softmax, an elegant unbiased extension of Softmax, to accommodate the label distribution shift between training and testing. Theoretically, we derive the generalization bound for multiclass Softmax regression and show our loss minimizes the bound. In addition, we introduce Balanced Meta-Softmax, applying a complementary Meta Sampler to estimate the optimal class sample rate and further improve long-tailed learning. In our experiments, we demonstrate that Balanced Meta-Softmax outperforms state-of-the-art long-tailed classification solutions on both visual recognition and instance segmentation tasks.

Jiawei Ren, Cunjun Yu, Shunan Sheng, Xiao Ma, Haiyu Zhao, Shuai Yi, Hongsheng Li• 2020

Related benchmarks

Task	Dataset	Result
Object Detection	LVIS v1.0 (val)	APbbox26.9	542
Image Classification	iNaturalist 2018	Top-1 Accuracy71.8	291
Image Classification	PACS	Overall Average Accuracy67.6	270
Image Classification	ImageNet LT	Top-1 Accuracy55.4	264
Long-Tailed Image Classification	ImageNet-LT (test)	Top-1 Acc (Overall)58	246
Image Classification	CIFAR-100 Long-Tailed (test)	Top-1 Accuracy63	234
Image Classification	iNaturalist 2018 (test)	Top-1 Accuracy71.8	223
Image Classification	CIFAR-10 long-tailed (test)	Top-1 Acc91.3	211
Image Classification	ImageNet-LT (test)	Top-1 Acc (All)57.1	159
Skeleton-based Action Recognition	NTU 120 (X-sub)	Accuracy80.7	139

Showing 10 of 146 rows

...

Other info

Code

Follow for update

@wizwand_team Discord