Classification is a Strong Baseline for Deep Metric Learning

About

Deep metric learning aims to learn a function mapping image pixels to embedding feature vectors that model the similarity between images. Two major applications of metric learning are content-based image retrieval and face verification. For the retrieval tasks, the majority of current state-of-the-art (SOTA) approaches are triplet-based non-parametric training. For the face verification tasks, however, recent SOTA approaches have adopted classification-based parametric training. In this paper, we look into the effectiveness of classification based approaches on image retrieval datasets. We evaluate on several standard retrieval datasets such as CAR-196, CUB-200-2011, Stanford Online Product, and In-Shop datasets for image retrieval and clustering, and establish that our classification-based approach is competitive across different feature dimensions and base feature networks. We further provide insights into the performance effects of subsampling classes for scalable classification-based training, and the effects of binarization, enabling efficient storage and computation for practical applications.

Andrew Zhai, Hao-Yu Wu• 2018

Related benchmarks

Task	Dataset	Result
Image Retrieval	CUB-200-2011 (test)	Recall@165.3	251
Image Retrieval	Stanford Online Products (test)	Recall@179.5	231
Image Retrieval	CUB-200 2011	Recall@161.3	163
Image Retrieval	CARS196 (test)	Recall@189.3	134
Deep Metric Learning	CUB200 2011 (test)	Recall@161.3	129
Image Retrieval	In-shop Clothes Retrieval Dataset	Recall@189.4	120
Image Retrieval	CARS 196	Recall@184.2	98
Image Retrieval	CUB	Recall@161.3	87
In-shop clothes retrieval	in-shop clothes retrieval dataset (test)	Recall@189.4	78
Deep Metric Learning	CARS196	Recall@184.2	50

Showing 10 of 32 rows

Other info

Code

Follow for update

@wizwand_team Discord