Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MIC: Mining Interclass Characteristics for Improved Metric Learning

About

Metric learning seeks to embed images of objects suchthat class-defined relations are captured by the embeddingspace. However, variability in images is not just due to different depicted object classes, but also depends on other latent characteristics such as viewpoint or illumination. In addition to these structured properties, random noise further obstructs the visual relations of interest. The common approach to metric learning is to enforce a representation that is invariant under all factors but the ones of interest. In contrast, we propose to explicitly learn the latent characteristics that are shared by and go across object classes. We can then directly explain away structured visual variability, rather than assuming it to be unknown random noise. We propose a novel surrogate task to learn visual characteristics shared across classes with a separate encoder. This encoder is trained jointly with the encoder for class information by reducing their mutual information. On five standard image retrieval benchmarks the approach significantly improves upon the state-of-the-art.

Karsten Roth, Biagio Brattoli, Bj\"orn Ommer• 2019

Related benchmarks

TaskDatasetResultRank
Image RetrievalCUB-200-2011 (test)
Recall@166.1
251
Image RetrievalStanford Online Products (test)
Recall@177.2
220
Image RetrievalCARS196 (test)
Recall@182.6
134
Deep Metric LearningCUB200 2011 (test)
Recall@166.1
129
Image RetrievalIn-shop Clothes Retrieval Dataset
Recall@188.2
120
Image RetrievalCUB
Recall@166.1
87
In-shop clothes retrievalin-shop clothes retrieval dataset (test)
Recall@188.2
78
Vehicle Re-identificationVehicleID (Small)
R-186.9
61
Image RetrievalCARS 196 (test)
Recall@182.6
56
Deep Metric LearningCARS196 (test)
R@182.6
56
Showing 10 of 23 rows

Other info

Code

Follow for update