ABC: Auxiliary Balanced Classifier for Class-imbalanced Semi-supervised Learning
About
Existing semi-supervised learning (SSL) algorithms typically assume class-balanced datasets, although the class distributions of many real-world datasets are imbalanced. In general, classifiers trained on a class-imbalanced dataset are biased toward the majority classes. This issue becomes more problematic for SSL algorithms because they utilize the biased prediction of unlabeled data for training. However, traditional class-imbalanced learning techniques, which are designed for labeled data, cannot be readily combined with SSL algorithms. We propose a scalable class-imbalanced SSL algorithm that can effectively use unlabeled data, while mitigating class imbalance by introducing an auxiliary balanced classifier (ABC) of a single layer, which is attached to a representation layer of an existing SSL algorithm. The ABC is trained with a class-balanced loss of a minibatch, while using high-quality representations learned from all data points in the minibatch using the backbone SSL algorithm to avoid overfitting and information loss.Moreover, we use consistency regularization, a recent SSL technique for utilizing unlabeled data in a modified way, to train the ABC to be balanced among the classes by selecting unlabeled data with the same probability for each class. The proposed algorithm achieves state-of-the-art performance in various class-imbalanced SSL experiments using four benchmark datasets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | CIFAR100 long-tailed (test) | Accuracy59.1 | 155 | |
| Classification | CIFAR100-LT (test) | Accuracy62.4 | 136 | |
| Image Classification | CIFAR10 long-tailed (test) | Accuracy83.8 | 68 | |
| Image Classification | CIFAR10 LT (test) | Accuracy83.8 | 68 | |
| Image Classification | CIFAR100 LT | Balanced Accuracy59.1 | 57 | |
| Image Classification | CIFAR-100 Long-Tailed (test) | Balanced Accuracy55.6 | 51 | |
| Image Classification | CIFAR10-LT | Accuracy83.8 | 48 | |
| Image Classification | SVHN-LT (test) | -- | 40 | |
| Image Classification | CIFAR-10-LT gamma=100 (test) | Accuracy85.2 | 35 | |
| Semi-supervised Long-tailed Image Classification | STL10 LT | Accuracy77.1 | 24 |