Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Producing augmentation-invariant embeddings from real-life imagery

About

This article presents an efficient way to produce feature-rich, high-dimensionality embedding spaces from real-life images. The features produced are designed to be independent from augmentations used in real-life cases which appear on social media. Our approach uses convolutional neural networks (CNN) to produce an embedding space. An ArcFace head was used to train the model by employing automatically produced augmentations. Additionally, we present a way to make an ensemble out of different embeddings containing the same semantic information, a way to normalize the resulting embedding using an external dataset, and a novel way to perform quick training of these models with a high number of classes in the ArcFace head. Using this approach we achieved the 2nd place in the 2021 Facebook AI Image Similarity Challenge: Descriptor Track.

Sergio Manuel Papadakis, Sanjay Addicam• 2021

Related benchmarks

TaskDatasetResultRank
Image Copy DetectionDeepFloyd IF (test)
Average Similarity43.8
28
Image Copy DetectionMidjourney (test)
Average Similarity26.5
28
Image Copy DetectionDALL-E 2 (test)
Average Similarity0.267
28
Image Copy DetectionNew Bing (test)
Average Similarity0.249
28
Image Copy DetectionSDXL (test)
Avg Similarity34
28
Image Copy DetectionGLIDE (test)
Average Similarity0.349
28
Image Copy DetectionSD 1.5 (test)
Average Similarity0.133
28
ICDiffD-Rep (test)
PCC30.5
20
Image Copy Detection (Descriptor)DISC 2021 (test)
μAP66.4
14
Copy detectionAnyPattern (test)
µAP14.51
8
Showing 10 of 10 rows

Other info

Follow for update