Learning Fine-grained Image Similarity with Deep Ranking
About
Learning fine-grained image similarity is a challenging task. It needs to capture between-class and within-class image differences. This paper proposes a deep ranking model that employs deep learning techniques to learn similarity metric directly from images.It has higher learning capability than models based on hand-crafted features. A novel multiscale network structure has been developed to describe the images effectively. An efficient triplet sampling algorithm is proposed to learn the model with distributed asynchronized stochastic gradient. Extensive experiments show that the proposed algorithm outperforms models based on hand-crafted visual features and deep classification models.
Jiang Wang, Yang song, Thomas Leung, Chuck Rosenberg, Jinbin Wang, James Philbin, Bo Chen, Ying Wu• 2014
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Visual Similarity | Street2Shop | Training Speed (min/epoch)165 | 8 | |
| Image Retrieval | Street2Shop (test) | Max Test Accuracy90.8 | 8 | |
| Image Retrieval | Street2Shop (train) | Max Training Accuracy87.72 | 8 | |
| Image Semantic Embedding | PIT (internal evaluation) | Triplet Accuracy74.95 | 5 | |
| Image Semantic Embedding | GIT (internal evaluation) | Triplet Accuracy77.25 | 5 | |
| kNN search accuracy | ImageNet | Top-1 Accuracy35.2 | 5 | |
| kNN search accuracy | iNaturalist | Top-1 Accuracy6.03 | 5 | |
| Image Retrieval | Exact Street2Shop (val) | Accuracy93.39 | 3 | |
| Image Retrieval | Exact Street2Shop (test) | Top-20 Recall87.914 | 3 |
Showing 9 of 9 rows