Feature Generating Networks for Zero-Shot Learning
About
Suffering from the extreme training data imbalance between seen and unseen classes, most of existing state-of-the-art approaches fail to achieve satisfactory results for the challenging generalized zero-shot learning task. To circumvent the need for labeled examples of unseen classes, we propose a novel generative adversarial network (GAN) that synthesizes CNN features conditioned on class-level semantic information, offering a shortcut directly from a semantic descriptor of a class to a class-conditional feature distribution. Our proposed approach, pairing a Wasserstein GAN with a classification loss, is able to generate sufficiently discriminative CNN features to train softmax classifiers or any multimodal embedding method. Our experimental results demonstrate a significant boost in accuracy over the state of the art on five challenging datasets -- CUB, FLO, SUN, AWA and ImageNet -- in both the zero-shot learning and generalized zero-shot learning settings.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Action Recognition | UCF101 | Accuracy37.5 | 365 | |
| Action Recognition | UCF101 (test) | Accuracy44.4 | 307 | |
| Generalized Zero-Shot Learning | CUB | H Score79.7 | 250 | |
| Action Recognition | HMDB51 (test) | Accuracy32.7 | 249 | |
| Few-shot Image Classification | Mini-Imagenet (test) | -- | 235 | |
| Generalized Zero-Shot Learning | SUN | H59.5 | 184 | |
| Generalized Zero-Shot Learning | AWA2 | S Score71 | 165 | |
| Zero-shot Learning | CUB | Top-1 Accuracy84.5 | 144 | |
| Zero-shot Learning | SUN | Top-1 Accuracy75.5 | 114 | |
| Zero-shot Learning | AWA2 | Top-1 Accuracy0.759 | 95 |