Balanced Meta-Softmax for Long-Tailed Visual Recognition
About
Deep classifiers have achieved great success in visual recognition. However, real-world data is long-tailed by nature, leading to the mismatch between training and testing distributions. In this paper, we show that the Softmax function, though used in most classification tasks, gives a biased gradient estimation under the long-tailed setup. This paper presents Balanced Softmax, an elegant unbiased extension of Softmax, to accommodate the label distribution shift between training and testing. Theoretically, we derive the generalization bound for multiclass Softmax regression and show our loss minimizes the bound. In addition, we introduce Balanced Meta-Softmax, applying a complementary Meta Sampler to estimate the optimal class sample rate and further improve long-tailed learning. In our experiments, we demonstrate that Balanced Meta-Softmax outperforms state-of-the-art long-tailed classification solutions on both visual recognition and instance segmentation tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Detection | LVIS v1.0 (val) | APbbox26.9 | 518 | |
| Image Classification | iNaturalist 2018 | Top-1 Accuracy71.8 | 287 | |
| Image Classification | ImageNet LT | Top-1 Accuracy55.4 | 251 | |
| Image Classification | PACS | Overall Average Accuracy67.6 | 230 | |
| Long-Tailed Image Classification | ImageNet-LT (test) | Top-1 Acc (Overall)58 | 220 | |
| Image Classification | CIFAR-10 long-tailed (test) | Top-1 Acc91.3 | 201 | |
| Image Classification | iNaturalist 2018 (test) | Top-1 Accuracy71.8 | 192 | |
| Image Classification | ImageNet-LT (test) | Top-1 Acc (All)57.1 | 159 | |
| Image Classification | CIFAR-100 Long-Tailed (test) | Top-1 Accuracy63 | 149 | |
| Skeleton-based Action Recognition | NTU 120 (X-sub) | Accuracy80.7 | 139 |