Potential Field Based Deep Metric Learning
About
Deep metric learning (DML) involves training a network to learn a semantically meaningful representation space. Many current approaches mine n-tuples of examples and model interactions within each tuplets. We present a novel, compositional DML model that instead of in tuples, represents the influence of each example (embedding) by a continuous potential field, and superposes the fields to obtain their combined global potential field. We use attractive/repulsive potential fields to represent interactions among embeddings from images of the same/different classes. Contrary to typical learning methods, where mutual influence of samples is proportional to their distance, we enforce reduction in such influence with distance, leading to a decaying field. We show that such decay helps improve performance on real world datasets with large intra-class variations and label noise. Like other proxy-based methods, we also use proxies to succinctly represent sub-populations of examples. We evaluate our method on three standard DML benchmarks- Cars-196, CUB-200-2011, and SOP datasets where it outperforms state-of-the-art baselines.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Retrieval | CUB-200-2011 (test) | -- | 251 | |
| Image Retrieval | CUB-200 2011 | Recall@187.8 | 146 | |
| Image Retrieval | CARS 196 | Recall@194.7 | 98 | |
| Image Retrieval | SOP | Recall@188.2 | 32 | |
| Image Retrieval | CUB-200 Ext. Real to Watercolor 2011 | R@125 | 18 | |
| Image Retrieval | CUB-200-2011 Ext. Real to Oil-painting | R@120.39 | 18 | |
| Image Retrieval | Cars196 Real to Oil-painting Ext. | R@117.34 | 18 | |
| Image Retrieval | Cars196 Ext. Real to Watercolor | R@116.19 | 18 | |
| Cross-Domain Image Retrieval | DomainNet Real to Infograph | R@132.77 | 16 | |
| Cross-Domain Image Retrieval | DomainNet Real to Painting | R@164.83 | 16 |