Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OpenMatch: Open-set Consistency Regularization for Semi-supervised Learning with Outliers

About

Semi-supervised learning (SSL) is an effective means to leverage unlabeled data to improve a model's performance. Typical SSL methods like FixMatch assume that labeled and unlabeled data share the same label space. However, in practice, unlabeled data can contain categories unseen in the labeled set, i.e., outliers, which can significantly harm the performance of SSL algorithms. To address this problem, we propose a novel Open-set Semi-Supervised Learning (OSSL) approach called OpenMatch. Learning representations of inliers while rejecting outliers is essential for the success of OSSL. To this end, OpenMatch unifies FixMatch with novelty detection based on one-vs-all (OVA) classifiers. The OVA-classifier outputs the confidence score of a sample being an inlier, providing a threshold to detect outliers. Another key contribution is an open-set soft-consistency regularization loss, which enhances the smoothness of the OVA-classifier with respect to input transformations and greatly improves outlier detection. OpenMatch achieves state-of-the-art performance on three datasets, and even outperforms a fully supervised model in detecting outliers unseen in unlabeled data on CIFAR10.

Kuniaki Saito, Donghyun Kim, Kate Saenko• 2021

Related benchmarks

TaskDatasetResultRank
Out-of-Distribution DetectionCIFAR100 (test)
AUROC95.8
57
Out-of-Distribution DetectionSVHN (test)
AUROC0.93
48
Anomaly DetectionImageNet (test)
AUC0.987
35
Open Set RecognitionCIFAR10 6 closed, 4 open classes 1.0
AUROC0.997
30
Out-of-Distribution DetectionCIFAR10 (test)--
22
Open-set Semi-Supervised LearningCIFAR-10 6/4 known-unknown class split
Accuracy94.1
15
Open Set RecognitionCIFAR100 55 known / 45 unknown
AUROC0.87
8
Open Set RecognitionCIFAR100 80 known / 20 unknown
AUROC86.8
8
Open-set Semi-Supervised ClassificationCIFAR100 (55/45 split)
Error Rate24.1
8
Open-set Semi-Supervised ClassificationCIFAR100 (80/20 split)
Error Rate29.5
8
Showing 10 of 14 rows

Other info

Code

Follow for update