Towards Realistic Semi-Supervised Learning

About

Deep learning is pushing the state-of-the-art in many computer vision applications. However, it relies on large annotated data repositories, and capturing the unconstrained nature of the real-world data is yet to be solved. Semi-supervised learning (SSL) complements the annotated training data with a large corpus of unlabeled data to reduce annotation cost. The standard SSL approach assumes unlabeled data are from the same distribution as annotated data. Recently, a more realistic SSL problem, called open-world SSL, is introduced, where the unannotated data might contain samples from unknown classes. In this paper, we propose a novel pseudo-label based approach to tackle SSL in open-world setting. At the core of our method, we utilize sample uncertainty and incorporate prior knowledge about class distribution to generate reliable class-distribution-aware pseudo-labels for unlabeled data belonging to both known and unknown classes. Our extensive experimentation showcases the effectiveness of our approach on several benchmark datasets, where it substantially outperforms the existing state-of-the-art on seven diverse datasets including CIFAR-100 (~17%), ImageNet-100 (~5%), and Tiny ImageNet (~9%). We also highlight the flexibility of our approach in solving novel class discovery task, demonstrate its stability in dealing with imbalanced data, and complement our approach with a technique to estimate the number of novel classes

Mamshad Nayeem Rizve, Navid Kardan, Mubarak Shah• 2022

Related benchmarks

Task	Dataset	Result
Image Classification	FGVC-Aircraft (test)	Accuracy55.4	322
Image Classification	Oxford-IIIT Pet (test)	Overall Accuracy53.9	59
Open-world semi-supervised learning	CIFAR-100 (test)	Overall Accuracy64.7	40
Long-tailed Image Recognition	CIFAR-10 long-tailed (test)	Accuracy (New Classes)66.8	39
Novel Class Discovery	CIFAR-100	ACC (Seen)0.685	27
Open-world semi-supervised learning	CIFAR-10	Accuracy (Seen)96.8	25
Open-world semi-supervised learning	CIFAR-100 50% known, 50% novel	Known Accuracy68.5	16
Image Classification	CIFAR-10	Accuracy (Seen)96.8	14
Image Classification	CIFAR-100	Seen Class Accuracy80	14
Generalized Novel Class Discovery	Tiny ImageNet (test)	Seen Accuracy59.1	13

Showing 10 of 21 rows

Other info

Code

Follow for update

@wizwand_team Discord