Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Dash: Semi-Supervised Learning with Dynamic Thresholding

About

While semi-supervised learning (SSL) has received tremendous attentions in many machine learning tasks due to its successful use of unlabeled data, existing SSL algorithms use either all unlabeled examples or the unlabeled examples with a fixed high-confidence prediction during the training progress. However, it is possible that too many correct/wrong pseudo labeled examples are eliminated/selected. In this work we develop a simple yet powerful framework, whose key idea is to select a subset of training examples from the unlabeled data when performing existing SSL methods so that only the unlabeled examples with pseudo labels related to the labeled data will be used to train models. The selection is performed at each updating iteration by only keeping the examples whose losses are smaller than a given threshold that is dynamically adjusted through the iteration. Our proposed approach, Dash, enjoys its adaptivity in terms of unlabeled data selection and its theoretical guarantee. Specifically, we theoretically establish the convergence rate of Dash from the view of non-convex optimization. Finally, we empirically demonstrate the effectiveness of the proposed method in comparison with state-of-the-art over benchmarks.

Yi Xu, Lei Shang, Jinxing Ye, Qi Qian, Yu-Feng Li, Baigui Sun, Hao Li, Rong Jin• 2021

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100 (test)--
3518
Image ClassificationCIFAR-10 (test)--
3381
Image ClassificationCIFAR-100--
622
Image ClassificationCIFAR10 (test)--
585
Image ClassificationCIFAR-10--
507
Image ClassificationSVHN
Accuracy97
359
Image ClassificationSTL-10 (test)
Accuracy41.06
357
Image ClassificationCIFAR-100 standard (test)--
133
Image ClassificationSTL-10
Top-1 Accuracy64.5
128
Image ClassificationCIFAR-10 40 labels
Error Rate8.93
81
Showing 10 of 78 rows
...

Other info

Code

Follow for update