Domain-Smoothing Network for Zero-Shot Sketch-Based Image Retrieval
About
Zero-Shot Sketch-Based Image Retrieval (ZS-SBIR) is a novel cross-modal retrieval task, where abstract sketches are used as queries to retrieve natural images under zero-shot scenario. Most existing methods regard ZS-SBIR as a traditional classification problem and employ a cross-entropy or triplet-based loss to achieve retrieval, which neglect the problems of the domain gap between sketches and natural images and the large intra-class diversity in sketches. Toward this end, we propose a novel Domain-Smoothing Network (DSN) for ZS-SBIR. Specifically, a cross-modal contrastive method is proposed to learn generalized representations to smooth the domain gap by mining relations with additional augmented samples. Furthermore, a category-specific memory bank with sketch features is explored to reduce intra-class diversity in the sketch domain. Extensive experiments demonstrate that our approach notably outperforms the state-of-the-art methods in both Sketchy and TU-Berlin datasets. Our source code is publicly available at https://github.com/haowang1992/DSN.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Sketch-based image retrieval | TU-Berlin Ext | mAP48.4 | 17 | |
| Sketch-based image retrieval | Sketchy Ext | mAP0.583 | 17 | |
| Zero-Shot Sketch-Based Image Retrieval | TU-Berlin to Sketchy 8 unseen classes Ext | mAP64.6 | 7 | |
| Zero-Shot Sketch-Based Image Retrieval | Sketchy -> TU-Berlin Ext (21 unseen classes) | mAP0.384 | 7 | |
| Zero-Shot Sketch-Based Image Retrieval | Sketchy -> QuickDraw Ext (11 unseen classes) | mAP15.2 | 7 | |
| Zero-Shot Sketch-Based Image Retrieval | TU-Berlin Ext -> QuickDraw Ext (10 unseen classes) | mAP0.229 | 7 |