ProxyNCA++: Revisiting and Revitalizing Proxy Neighborhood Component Analysis
About
We consider the problem of distance metric learning (DML), where the task is to learn an effective similarity measure between images. We revisit ProxyNCA and incorporate several enhancements. We find that low temperature scaling is a performance-critical component and explain why it works. Besides, we also discover that Global Max Pooling works better in general when compared to Global Average Pooling. Additionally, our proposed fast moving proxies also addresses small gradient issue of proxies, and this component synergizes well with low temperature scaling and Global Max Pooling. Our enhanced model, called ProxyNCA++, achieves a 22.9 percentage point average improvement of Recall@1 across four different zero-shot retrieval datasets compared to the original ProxyNCA algorithm. Furthermore, we achieve state-of-the-art results on the CUB200, Cars196, Sop, and InShop datasets, achieving Recall@1 scores of 72.2, 90.1, 81.4, and 90.9, respectively.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Retrieval | CUB-200-2011 (test) | Recall@172.2 | 251 | |
| Image Retrieval | Stanford Online Products (test) | Recall@181.4 | 220 | |
| Image Retrieval | CUB-200 2011 | Recall@164.7 | 146 | |
| Deep Metric Learning | CUB200 2011 (test) | Recall@169.1 | 129 | |
| Image Retrieval | In-shop Clothes Retrieval Dataset | Recall@190.9 | 120 | |
| Image Retrieval | CARS 196 | Recall@185.1 | 98 | |
| Image Retrieval | CUB | Recall@169 | 87 | |
| In-shop clothes retrieval | in-shop clothes retrieval dataset (test) | Recall@190.4 | 78 | |
| Deep Metric Learning | CARS196 (test) | R@186.5 | 56 | |
| Image Retrieval | Cars | R@186.5 | 44 |