Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Potential Field Based Deep Metric Learning

About

Deep metric learning (DML) involves training a network to learn a semantically meaningful representation space. Many current approaches mine n-tuples of examples and model interactions within each tuplets. We present a novel, compositional DML model that instead of in tuples, represents the influence of each example (embedding) by a continuous potential field, and superposes the fields to obtain their combined global potential field. We use attractive/repulsive potential fields to represent interactions among embeddings from images of the same/different classes. Contrary to typical learning methods, where mutual influence of samples is proportional to their distance, we enforce reduction in such influence with distance, leading to a decaying field. We show that such decay helps improve performance on real world datasets with large intra-class variations and label noise. Like other proxy-based methods, we also use proxies to succinctly represent sub-populations of examples. We evaluate our method on three standard DML benchmarks- Cars-196, CUB-200-2011, and SOP datasets where it outperforms state-of-the-art baselines.

Shubhang Bhatnagar, Narendra Ahuja• 2024

Related benchmarks

TaskDatasetResultRank
Image RetrievalCUB-200-2011 (test)--
251
Image RetrievalCUB-200 2011
Recall@187.8
146
Image RetrievalCARS 196
Recall@194.7
98
Image RetrievalSOP
Recall@188.2
32
Image RetrievalCUB-200 Ext. Real to Watercolor 2011
R@125
18
Image RetrievalCUB-200-2011 Ext. Real to Oil-painting
R@120.39
18
Image RetrievalCars196 Real to Oil-painting Ext.
R@117.34
18
Image RetrievalCars196 Ext. Real to Watercolor
R@116.19
18
Cross-Domain Image RetrievalDomainNet Real to Infograph
R@132.77
16
Cross-Domain Image RetrievalDomainNet Real to Painting
R@164.83
16
Showing 10 of 14 rows

Other info

Follow for update