Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Differentially Private Contrastive Learning via Bounding Group-level Contribution

About

Differentially private (DP) contrastive learning aims to learn general-purpose representations from sensitive data, alleviating the privacy leakage concerns of organizations deploying or sharing embedding models trained on private user content. However, existing approaches suffer from severe utility degradation due to the over-strong inter-sample dependency inherent in standard contrastive objectives, where each sample's gradient depends on all other samples in the batch, amplifying the impact of DP noise. In this work, we argue that effective DP contrastive learning requires explicitly reducing such intrinsic inter-sample reliance. To this end, we propose DP-GCL, a principled DP contrastive learning framework that structurally limits gradient dependency through bounding group-level contribution. DP-GCL partitions each batch into small, disjoint groups and restricts available negative samples to within-group samples, thereby localizing gradient influence and reducing sensitivity. To counteract the resulting loss of negative sample diversity, we further introduce intra-group augmentation, which generates additional negative views without increasing privacy cost. Extensive experiments across eight datasets demonstrate that DP-GCL consistently advances the state of the art in both uni-modal and multi-modal contrastive learning under practical privacy budgets: it improves image classification accuracy by 5.6% and image-text retrieval accuracy by 20.1% over existing DP contrastive methods.

Kecen Li, Chen Gong, Zinan Lin, Tianhao Wang, Xiaokui Xiao• 2026

Related benchmarks

TaskDatasetResultRank
ClassificationCIFAR10 (test)
Accuracy42.3
331
Image ClassificationF-MNIST (test)
Accuracy82.9
156
Text-to-Image RetrievalCUHK-PEDES (test)
Recall@148.1
114
ClassificationEuroSAT
Top-1 Accuracy53.5
26
ClassificationEuroSAT (test)
Top-1 Acc59.1
24
Image-to-Text RetrievalCUHK-PEDES (test)--
24
ClassificationCAMELYON (test)
Accuracy78.4
20
ClassificationCAMELYON
Accuracy69.8
20
Text-to-image person retrievalRSTPReid (test)--
17
Image-to-Text RetrievalROCO (test)
Retrieval Accuracy37.5
10
Showing 10 of 14 rows

Other info

Follow for update