Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Identifying and Compensating for Feature Deviation in Imbalanced Deep Learning

About

Classifiers trained with class-imbalanced data are known to perform poorly on test data of the "minor" classes, of which we have insufficient training data. In this paper, we investigate learning a ConvNet classifier under such a scenario. We found that a ConvNet significantly over-fits the minor classes, which is quite opposite to traditional machine learning algorithms that often under-fit minor classes. We conducted a series of analysis and discovered the feature deviation phenomenon -- the learned ConvNet generates deviated features between the training and test data of minor classes -- which explains how over-fitting happens. To compensate for the effect of feature deviation which pushes test data toward low decision value regions, we propose to incorporate class-dependent temperatures (CDT) in training a ConvNet. CDT simulates feature deviation in the training phase, forcing the ConvNet to enlarge the decision values for minor-class data so that it can overcome real feature deviation in the test phase. We validate our approach on benchmark datasets and achieve promising performance. We hope that our insights can inspire new ways of thinking in resolving class-imbalanced deep learning.

Han-Jia Ye, Hong-You Chen, De-Chuan Zhan, Wei-Lun Chao• 2020

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100 LT (val)--
69
Image ClassificationCIFAR-10-LT (val)--
65
Image ClassificationCIFAR100 LT
Balanced Accuracy57.26
57
Image ClassificationImageNet-LT 1.0 (test)--
37
Image ClassificationCIFAR10-LT imbalance factor 100 1.0 (test)
Balanced Error20.73
9
Image ClassificationCIFAR100-LT imbalance factor 100 1.0 (test)
Balanced Error57.26
9
Image ClassificationCIFAR-10-LT
Balanced Accuracy20.73
9
Showing 7 of 7 rows

Other info

Follow for update