Towards Domain-Agnostic Contrastive Learning
About
Despite recent success, most contrastive self-supervised learning methods are domain-specific, relying heavily on data augmentation techniques that require knowledge about a particular domain, such as image cropping and rotation. To overcome such limitation, we propose a novel domain-agnostic approach to contrastive learning, named DACL, that is applicable to domains where invariances, and thus, data augmentation techniques, are not readily available. Key to our approach is the use of Mixup noise to create similar and dissimilar examples by mixing data samples differently either at the input or hidden-state levels. To demonstrate the effectiveness of DACL, we conduct experiments across various domains such as tabular data, images, and graphs. Our results show that DACL not only outperforms other domain-agnostic noising methods, such as Gaussian-noise, but also combines well with domain-specific methods, such as SimCLR, to improve self-supervised visual representation learning. Finally, we theoretically analyze our method and show advantages over the Gaussian-noise based contrastive learning approach.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Detection | PASCAL VOC 2007 (test) | mAP57.1 | 821 | |
| Image Classification | ImageNet-1k (val) | Top-1 Acc72.3 | 706 | |
| Image Classification | CIFAR-10 | Accuracy94.4 | 471 | |
| Object Detection | MS-COCO 2017 (val) | -- | 237 | |
| Linear Classification | ImageNet-1K 1.0 (val) | Top-1 Accuracy72.9 | 48 | |
| Activity Recognition | HHAR (test) | Mean F1 Score81.31 | 46 | |
| Activity Recognition | UCIHAR (test) | Macro F1 Score66.28 | 43 | |
| Heart-rate prediction | IEEE SPC12 | MAE21.85 | 31 | |
| Heart-rate prediction | IEEE SPC 22 | MAE14.67 | 12 | |
| Heart-rate prediction | DaLia | MAE18.44 | 12 |