Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Weakly Supervised Contrastive Learning

About

Unsupervised visual representation learning has gained much attention from the computer vision community because of the recent achievement of contrastive learning. Most of the existing contrastive learning frameworks adopt the instance discrimination as the pretext task, which treating every single instance as a different class. However, such method will inevitably cause class collision problems, which hurts the quality of the learned representation. Motivated by this observation, we introduced a weakly supervised contrastive learning framework (WCL) to tackle this issue. Specifically, our proposed framework is based on two projection heads, one of which will perform the regular instance discrimination task. The other head will use a graph-based method to explore similar samples and generate a weak label, then perform a supervised contrastive learning task based on the weak label to pull the similar images closer. We further introduced a K-Nearest Neighbor based multi-crop strategy to expand the number of positive samples. Extensive experimental results demonstrate WCL improves the quality of self-supervised representations across different datasets. Notably, we get a new state-of-the-art result for semi-supervised learning. With only 1\% and 10\% labeled examples, WCL achieves 65\% and 72\% ImageNet Top-1 Accuracy using ResNet50, which is even higher than SimCLRv2 with ResNet101.

Mingkai Zheng, Fei Wang, Shan You, Chen Qian, Changshui Zhang, Xiaogang Wang, Chang Xu• 2021

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet-1k (val)
Top-1 Accuracy74.7
1469
ClassificationImageNet-1K 1.0 (val)
Top-1 Accuracy (%)73.3
1163
Semantic Textual SimilaritySTS tasks (STS12, STS13, STS14, STS15, STS16, STS-B, SICK-R) various (test)
STS12 Score34.81
412
Image ClassificationImageNet
Top-1 Accuracy74.7
366
ClassificationPASCAL VOC 2007 (test)
mAP (%)87.8
217
3D Semantic SegmentationScanNet V2 (val)
mIoU69.2
209
Image ClassificationImageNet (val)
Top-1 Accuracy74.7
125
Image ClassificationImageNet 1% labeled
Top-5 Accuracy86.3
118
Image ClassificationImageNet 1k (10% labels)
Top-1 Acc72
92
ClassificationImageNet-1k (val)
Top-1 Acc72.2
37
Showing 10 of 23 rows

Other info

Follow for update