Long-tailed Recognition by Routing Diverse Distribution-Aware Experts
About
Natural data are often long-tail distributed over semantic classes. Existing recognition methods tackle this imbalanced classification by placing more emphasis on the tail data, through class re-balancing/re-weighting or ensembling over different data groups, resulting in increased tail accuracies but reduced head accuracies. We take a dynamic view of the training data and provide a principled model bias and variance analysis as the training data fluctuates: Existing long-tail classifiers invariably increase the model variance and the head-tail model bias gap remains large, due to more and larger confusion with hard negatives for the tail. We propose a new long-tailed classifier called RoutIng Diverse Experts (RIDE). It reduces the model variance with multiple experts, reduces the model bias with a distribution-aware diversity loss, reduces the computational cost with a dynamic expert routing module. RIDE outperforms the state-of-the-art by 5% to 7% on CIFAR100-LT, ImageNet-LT and iNaturalist 2018 benchmarks. It is also a universal framework that is applicable to various backbone networks, long-tailed algorithms, and training mechanisms for consistent performance gains. Our code is available at: https://github.com/frank-xwang/RIDE-LongTailRecognition.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | iNaturalist 2018 | Top-1 Accuracy72.2 | 287 | |
| Image Classification | ImageNet LT | Top-1 Accuracy56.8 | 251 | |
| Long-Tailed Image Classification | ImageNet-LT (test) | Top-1 Acc (Overall)60.2 | 220 | |
| Image Classification | iNaturalist 2018 (test) | Top-1 Accuracy72.6 | 192 | |
| Image Classification | CIFAR-10-LT (test) | -- | 185 | |
| Image Classification | ImageNet-LT (test) | Top-1 Acc (All)56.8 | 159 | |
| Image Classification | CIFAR-100 Long-Tailed (test) | Top-1 Accuracy59 | 149 | |
| Image Classification | iNaturalist 2018 (val) | Top-1 Accuracy72.6 | 116 | |
| Long-tailed Visual Recognition | ImageNet LT | Overall Accuracy56.8 | 89 | |
| Image Classification | CIFAR-100-LT Imbalance Ratio 100 | Top-1 Acc0.504 | 88 |