Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Unsupervised Hashing with Similarity Distribution Calibration

About

Unsupervised hashing methods typically aim to preserve the similarity between data points in a feature space by mapping them to binary hash codes. However, these methods often overlook the fact that the similarity between data points in the continuous feature space may not be preserved in the discrete hash code space, due to the limited similarity range of hash codes. The similarity range is bounded by the code length and can lead to a problem known as similarity collapse. That is, the positive and negative pairs of data points become less distinguishable from each other in the hash space. To alleviate this problem, in this paper a novel Similarity Distribution Calibration (SDC) method is introduced. SDC aligns the hash code similarity distribution towards a calibration distribution (e.g., beta distribution) with sufficient spread across the entire similarity range, thus alleviating the similarity collapse problem. Extensive experiments show that our SDC outperforms significantly the state-of-the-art alternatives on coarse category-level and instance-level image retrieval. Code is available at https://github.com/kamwoh/sdc.

Kam Woh Ng, Xiatian Zhu, Jiun Tian Hoe, Chee Seng Chan, Tianyu Zhang, Yi-Zhe Song, Tao Xiang• 2023

Related benchmarks

TaskDatasetResultRank
Image RetrievalOxford Flowers
mAP50.24
99
Image RetrievalNUS-WIDE
mAP84.2
57
Image-to-Image RetrievalFood101--
55
Fine-grained Image HashingCUB200-2011
Collision Probability0.067
30
Fine-grained Image HashingCUB200 2011 (test)
Collision Probability7.40e-4
30
Fine-grained Image HashingStanford Dogs
Collision Probability5.1
30
Fine-grained Image HashingStanford Dogs (test)
Collision Probability0.055
30
Fine-grained Image HashingNUS-WIDE (test)
Collision Probability (%)0.24
18
Showing 8 of 8 rows

Other info

Follow for update