Latent Embeddings for Zero-shot Classification
About
We present a novel latent embedding model for learning a compatibility function between image and class embeddings, in the context of zero-shot classification. The proposed method augments the state-of-the-art bilinear compatibility model by incorporating latent variables. Instead of learning a single bilinear map, it learns a collection of maps with the selection, of which map to use, being a latent variable for the current image-class pair. We train the model with a ranking based objective function which penalizes incorrect rankings of the true class for a given image. We empirically demonstrate that our model improves the state-of-the-art for various class embeddings consistently on three challenging publicly available datasets for the zero-shot setting. Moreover, our method leads to visually highly interpretable results with clear clusters of different fine-grained object properties that correspond to different latent variable maps.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Generalized Zero-Shot Learning | CUB | H Score24 | 250 | |
| Generalized Zero-Shot Learning | SUN | H19.5 | 184 | |
| Generalized Zero-Shot Learning | AWA2 | S Score77.3 | 165 | |
| Zero-shot Learning | CUB | Top-1 Accuracy49.3 | 144 | |
| Zero-shot Learning | SUN | Top-1 Accuracy55.3 | 114 | |
| Zero-shot Learning | AWA2 | Top-1 Accuracy0.558 | 95 | |
| Image Classification | CUB | Unseen Top-1 Acc15.2 | 89 | |
| Image Classification | SUN | Harmonic Mean Top-1 Accuracy19.5 | 86 | |
| Zero-shot Learning | SUN (unseen) | Top-1 Accuracy (%)55.3 | 50 | |
| Generalized Zero-Shot Learning | AWA1 | S Score71.7 | 49 |