Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss

About

Deep learning algorithms can fare poorly when the training dataset suffers from heavy class-imbalance but the testing criterion requires good generalization on less frequent classes. We design two novel methods to improve performance in such scenarios. First, we propose a theoretically-principled label-distribution-aware margin (LDAM) loss motivated by minimizing a margin-based generalization bound. This loss replaces the standard cross-entropy objective during training and can be applied with prior strategies for training with class-imbalance such as re-weighting or re-sampling. Second, we propose a simple, yet effective, training schedule that defers re-weighting until after the initial stage, allowing the model to learn an initial representation while avoiding some of the complications associated with re-weighting or re-sampling. We test our methods on several benchmark vision tasks including the real-world imbalanced dataset iNaturalist 2018. Our experiments show that either of these methods alone can already improve over existing techniques and their combination achieves even better performance gains.

Kaidi Cao, Colin Wei, Adrien Gaidon, Nikos Arechiga, Tengyu Ma• 2019

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100 (test)
Accuracy54.01
3518
Image ClassificationCIFAR-10 (test)
Accuracy85.94
3381
Fine-grained Image ClassificationCUB200 2011 (test)
Accuracy88.2
543
Fine-grained Image ClassificationStanford Cars (test)
Accuracy93.6
348
Fine-grained visual classificationFGVC-Aircraft (test)
Top-1 Acc92.9
312
Image ClassificationiNaturalist 2018
Top-1 Accuracy68
291
Image ClassificationImageNet LT
Top-1 Accuracy51.1
264
Image ClassificationPACS
Overall Average Accuracy66.6
241
Image ClassificationCIFAR-100 Long-Tailed (test)
Top-1 Accuracy62.1
234
Long-Tailed Image ClassificationImageNet-LT (test)
Top-1 Acc (Overall)49.8
220
Showing 10 of 309 rows
...

Other info

Code

Follow for update