Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

A Self-Supervised Descriptor for Image Copy Detection

About

Image copy detection is an important task for content moderation. We introduce SSCD, a model that builds on a recent self-supervised contrastive training objective. We adapt this method to the copy detection task by changing the architecture and training objective, including a pooling operator from the instance matching literature, and adapting contrastive learning to augmentations that combine images. Our approach relies on an entropy regularization term, promoting consistent separation between descriptor vectors, and we demonstrate that this significantly improves copy detection accuracy. Our method produces a compact descriptor vector, suitable for real-world web scale applications. Statistical information from a background image distribution can be incorporated into the descriptor. On the recent DISC2021 benchmark, SSCD is shown to outperform both baseline copy detection models and self-supervised architectures designed for image classification by huge margins, in all settings. For example, SSCD out-performs SimCLR descriptors by 48% absolute. Code is available at https://github.com/facebookresearch/sscd-copy-detection.

Ed Pizzi, Sreya Dutta Roy, Sugosh Nagavara Ravindra, Priya Goyal, Matthijs Douze• 2022

Related benchmarks

TaskDatasetResultRank
Image Copy DetectionDeepFloyd IF (test)
Average Similarity30.3
28
Image Copy DetectionSDXL (test)
Avg Similarity23.9
28
Image Copy DetectionDALL-E 2 (test)
Average Similarity0.18
28
Image Copy DetectionNew Bing (test)
Average Similarity0.166
28
Image Copy DetectionGLIDE (test)
Average Similarity0.266
28
Image Copy DetectionSD 1.5 (test)
Average Similarity0.116
28
Image Copy DetectionMidjourney (test)
Average Similarity18.1
28
Copy detectionINRIA Copydays strong 10k YFCC100M distractors
mAP98.1
25
ICDiffD-Rep (test)
PCC29.1
20
Image Copy DetectionDISC 2021 (val)
µAP63.7
14
Showing 10 of 16 rows

Other info

Code

Follow for update