InstanT: Semi-supervised Learning with Instance-dependent Thresholds
About
Semi-supervised learning (SSL) has been a fundamental challenge in machine learning for decades. The primary family of SSL algorithms, known as pseudo-labeling, involves assigning pseudo-labels to confident unlabeled instances and incorporating them into the training set. Therefore, the selection criteria of confident instances are crucial to the success of SSL. Recently, there has been growing interest in the development of SSL methods that use dynamic or adaptive thresholds. Yet, these methods typically apply the same threshold to all samples, or use class-dependent thresholds for instances belonging to a certain class, while neglecting instance-level information. In this paper, we propose the study of instance-dependent thresholds, which has the highest degree of freedom compared with existing methods. Specifically, we devise a novel instance-dependent threshold function for all unlabeled instances by utilizing their instance-level ambiguity and the instance-dependent error rates of pseudo-labels, so instances that are more likely to have incorrect pseudo-labels will have higher thresholds. Furthermore, we demonstrate that our instance-dependent threshold function provides a bounded probabilistic guarantee for the correctness of the pseudo-labels it assigns.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | CIFAR-10 (test) | -- | 3381 | |
| Image Classification | STL-10 (test) | Accuracy69.39 | 357 | |
| Image Classification | CIFAR-10 40 labeled samples | Top-1 Accuracy94.83 | 20 | |
| Image Classification | CIFAR-10 40 labels (test) | Top-1 Accuracy97.93 | 12 | |
| Image Classification | CIFAR-10 250 labels (test) | Top-1 Accuracy98.08 | 10 | |
| Image Classification | STL-10 40 labels (test) | Top-1 Accuracy85.09 | 10 | |
| Image Classification | CIFAR-100 200 labels (test) | Top-1 Accuracy74.17 | 10 | |
| Image Classification | CIFAR-100 400 labels (test) | Top-1 Acc78.8 | 10 | |
| Image Classification | STL-10 100 labels (test) | Top-1 Acc89.35 | 10 | |
| Image Classification | CIFAR-10 250 labeled samples | -- | 9 |