Classification is a Strong Baseline for Deep Metric Learning
About
Deep metric learning aims to learn a function mapping image pixels to embedding feature vectors that model the similarity between images. Two major applications of metric learning are content-based image retrieval and face verification. For the retrieval tasks, the majority of current state-of-the-art (SOTA) approaches are triplet-based non-parametric training. For the face verification tasks, however, recent SOTA approaches have adopted classification-based parametric training. In this paper, we look into the effectiveness of classification based approaches on image retrieval datasets. We evaluate on several standard retrieval datasets such as CAR-196, CUB-200-2011, Stanford Online Product, and In-Shop datasets for image retrieval and clustering, and establish that our classification-based approach is competitive across different feature dimensions and base feature networks. We further provide insights into the performance effects of subsampling classes for scalable classification-based training, and the effects of binarization, enabling efficient storage and computation for practical applications.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Retrieval | CUB-200-2011 (test) | Recall@165.3 | 251 | |
| Image Retrieval | Stanford Online Products (test) | Recall@179.5 | 220 | |
| Image Retrieval | CUB-200 2011 | Recall@161.3 | 146 | |
| Image Retrieval | CARS196 (test) | Recall@189.3 | 134 | |
| Deep Metric Learning | CUB200 2011 (test) | Recall@161.3 | 129 | |
| Image Retrieval | In-shop Clothes Retrieval Dataset | Recall@189.4 | 120 | |
| Image Retrieval | CARS 196 | Recall@184.2 | 98 | |
| Image Retrieval | CUB | Recall@161.3 | 87 | |
| In-shop clothes retrieval | in-shop clothes retrieval dataset (test) | Recall@189.4 | 78 | |
| Deep Metric Learning | CARS196 | Recall@184.2 | 50 |