Repetitive Reprediction Deep Decipher for Semi-Supervised Learning
About
Most recent semi-supervised deep learning (deep SSL) methods used a similar paradigm: use network predictions to update pseudo-labels and use pseudo-labels to update network parameters iteratively. However, they lack theoretical support and cannot explain why predictions are good candidates for pseudo-labels. In this paper, we propose a principled end-to-end framework named deep decipher (D2) for SSL. Within the D2 framework, we prove that pseudo-labels are related to network predictions by an exponential link function, which gives a theoretical support for using predictions as pseudo-labels. Furthermore, we demonstrate that updating pseudo-labels by network predictions will make them uncertain. To mitigate this problem, we propose a training strategy called repetitive reprediction (R2). Finally, the proposed R2-D2 method is tested on the large-scale ImageNet dataset and outperforms state-of-the-art methods by 5 percentage points.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | SVHN 1000 labels (test) | Error Rate3.64 | 69 | |
| Image Classification | CIFAR-10 4,000 labels (test) | -- | 57 | |
| Image Classification | ImageNet 10% labels 1K (val) | Top-5 Error19.52 | 18 | |
| Image Classification | CIFAR-100 4000 labels standard | Error Rate32.77 | 5 | |
| Image Classification | CIFAR-100 10000 labels standard | Error Rate32.87 | 5 |