SynCo: Synthetic Hard Negatives for Contrastive Visual Representation Learning

About

Contrastive learning has become a dominant approach in self-supervised visual representation learning, but efficiently leveraging hard negatives, which are samples closely resembling the anchor, remains challenging. We introduce SynCo (Synthetic negatives in Contrastive learning), a novel approach that improves model performance by generating synthetic hard negatives on the representation space. Building on the MoCo framework, SynCo introduces six strategies for creating diverse synthetic hard negatives on-the-fly with minimal computational overhead. SynCo achieves faster training and strong representation learning, surpassing MoCo-v2 by +0.4% and MoCHI by +1.0% on ImageNet ILSVRC-2012 linear evaluation. It also transfers more effectively to detection tasks achieving strong results on PASCAL VOC detection (57.2% AP) and significantly improving over MoCo-v2 on COCO detection (+1.0% AP) and instance segmentation (+0.8% AP). Our synthetic hard negative generation approach significantly enhances visual representations learned through self-supervised contrastive learning.

Nikolaos Giakoumoglou, Tania Stathaki• 2024

Related benchmarks

Task	Dataset	Result
Object Detection	COCO 2017 (val)	--	2843
Instance Segmentation	COCO 2017 (val)	--	1275
Image Classification	ImageNet ILSVRC-2012 (val)	Top-1 Accuracy70.7	441
Object Detection	PASCAL VOC 2007+2012 (test)	mAP (mean Average Precision)57.2	95
Semi-supervised Image Classification	ImageNet (10% labeled)	Top-1 Accuracy66.6	32
Semi-supervised Image Classification	ImageNet ILSVRC 2012 (1% labels)	Top-1 Acc50.8	9

Showing 6 of 6 rows

Other info

Code

Follow for update

@wizwand_team Discord