Contrastive Quantization with Code Memory for Unsupervised Image Retrieval
About
The high efficiency in computation and storage makes hashing (including binary hashing and quantization) a common strategy in large-scale retrieval systems. To alleviate the reliance on expensive annotations, unsupervised deep hashing becomes an important research problem. This paper provides a novel solution to unsupervised deep quantization, namely Contrastive Quantization with Code Memory (MeCoQ). Different from existing reconstruction-based strategies, we learn unsupervised binary descriptors by contrastive learning, which can better capture discriminative visual semantics. Besides, we uncover that codeword diversity regularization is critical to prevent contrastive learning-based quantization from model degeneration. Moreover, we introduce a novel quantization code memory module that boosts contrastive learning with lower feature drift than conventional feature memories. Extensive experiments on benchmark datasets show that MeCoQ outperforms state-of-the-art methods. Code and configurations are publicly available at https://github.com/gimpong/AAAI22-MeCoQ.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Retrieval | Oxford Flowers | mAP39.58 | 99 | |
| Image Retrieval | NUS-WIDE | mAP83.2 | 57 | |
| Image-to-Image Retrieval | Food101 | -- | 55 | |
| Fine-grained Image Hashing | Stanford Dogs | Collision Probability4.4 | 30 | |
| Fine-grained Image Hashing | Stanford Dogs (test) | Collision Probability0.045 | 30 | |
| Fine-grained Image Hashing | CUB200 2011 (test) | Collision Probability8.10e-4 | 30 | |
| Fine-grained Image Hashing | CUB200-2011 | Collision Probability0.08 | 30 | |
| Fine-grained Image Hashing | NUS-WIDE (test) | Collision Probability (%)0.338 | 18 |