Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Image Hashing via Cross-View Code Alignment in the Age of Foundation Models

About

Efficient large-scale retrieval requires representations that are both compact and discriminative. Foundation models provide powerful visual and multimodal embeddings, but nearest neighbor search in these high-dimensional spaces is computationally expensive. Hashing offers an efficient alternative by enabling fast Hamming distance search with binary codes, yet existing approaches often rely on complex pipelines, multi-term objectives, designs specialized for a single learning paradigm, and long training times. We introduce CroVCA (Cross-View Code Alignment), a simple and unified principle for learning binary codes that remain consistent across semantically aligned views. A single binary cross-entropy loss enforces alignment, while coding-rate maximization serves as an anti-collapse regularizer to promote balanced and diverse codes. To implement this, we design HashCoder, a lightweight MLP hashing network with a final batch normalization layer to enforce balanced codes. HashCoder can be used as a probing head on frozen embeddings or to adapt encoders efficiently via LoRA fine-tuning. Across benchmarks, CroVCA achieves state-of-the-art results in just 5 training epochs. At 16 bits, it performs particularly well; for instance, unsupervised hashing on COCO completes in under 2 minutes and supervised hashing on ImageNet100 in about 3 minutes on a single GPU. These results highlight CroVCA's efficiency, adaptability, and broad applicability.

Ilyass Moummad, Kawtar Zaher, Herv\'e Go\"eau, Alexis Joly• 2025

Related benchmarks

TaskDatasetResultRank
Image RetrievalNUS-WIDE
P@100080.2
50
Unsupervised Image HashingCOCO
mAP90.1
40
Unsupervised Image HashingNUS-WIDE
mAP84.3
40
Unsupervised Image HashingCIFAR10
mAP98.7
34
Unsupervised Image HashingFLICKR25K
mAP83.3
34
Unsupervised Image HashingImageNet-100
mAP94.3
31
Image RetrievalImageNet100
mAP93.5
30
Image RetrievalCIFAR10
mAP97.4
30
Image RetrievalFLICKR25K
mAP81.1
30
Image RetrievalCOCO--
19
Showing 10 of 12 rows

Other info

Follow for update