Invertible Zero-Shot Recognition Flows
About
Deep generative models have been successfully applied to Zero-Shot Learning (ZSL) recently. However, the underlying drawbacks of GANs and VAEs (e.g., the hardness of training with ZSL-oriented regularizers and the limited generation quality) hinder the existing generative ZSL models from fully bypassing the seen-unseen bias. To tackle the above limitations, for the first time, this work incorporates a new family of generative models (i.e., flow-based models) into ZSL. The proposed Invertible Zero-shot Flow (IZF) learns factorized data embeddings (i.e., the semantic factors and the non-semantic ones) with the forward pass of an invertible flow network, while the reverse pass generates data samples. This procedure theoretically extends conventional generative flows to a factorized conditional scheme. To explicitly solve the bias problem, our model enlarges the seen-unseen distributional discrepancy based on negative sample-based distance measurement. Notably, IZF works flexibly with either a naive Bayesian classifier or a held-out trainable one for zero-shot recognition. Experiments on widely-adopted ZSL benchmarks demonstrate the significant performance gain of IZF over existing methods, in both classic and generalized settings.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Generalized Zero-Shot Learning | CUB | H Score59.4 | 250 | |
| Generalized Zero-Shot Learning | SUN | H54.8 | 184 | |
| Generalized Zero-Shot Learning | AWA2 | S Score77.5 | 165 | |
| Image Classification | SUN | Harmonic Mean Top-1 Accuracy54.8 | 86 | |
| Generalized Zero-Shot Learning | AWA1 | S Score80.5 | 49 | |
| Classification | AWA2 (test) | MCA (unseen)60.6 | 22 | |
| Zero-shot Image Classification | APY (test) | Metric u42.3 | 21 | |
| Classification | CUB (test) | MCA_u52.7 | 17 | |
| Generalized Zero-Shot Learning | CUB 67 (test) | Unseen Accuracy0.527 | 5 |