Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

A Generative Adversarial Approach for Zero-Shot Learning from Noisy Texts

About

Most existing zero-shot learning methods consider the problem as a visual semantic embedding one. Given the demonstrated capability of Generative Adversarial Networks(GANs) to generate images, we instead leverage GANs to imagine unseen categories from text descriptions and hence recognize novel classes with no examples being seen. Specifically, we propose a simple yet effective generative model that takes as input noisy text descriptions about an unseen class (e.g.Wikipedia articles) and generates synthesized visual features for this class. With added pseudo data, zero-shot learning is naturally converted to a traditional classification problem. Additionally, to preserve the inter-class discrimination of the generated features, a visual pivot regularization is proposed as an explicit supervision. Unlike previous methods using complex engineered regularizers, our approach can suppress the noise well without additional regularization. Empirically, we show that our method consistently outperforms the state of the art on the largest available benchmarks on Text-based Zero-shot Learning.

Yizhe Zhu, Mohamed Elhoseiny, Bingchen Liu, Xi Peng, Ahmed Elgammal• 2017

Related benchmarks

TaskDatasetResultRank
Zero-shot LearningCUB
Top-1 Accuracy55.8
144
Image ClassificationCUB
Unseen Top-1 Acc35.2
89
Image ClassificationSUN
Harmonic Mean Top-1 Accuracy30.8
86
Zero-shot LearningSUN (unseen)
Top-1 Accuracy (%)62.8
50
Zero-shot LearningCUB (unseen)
Top-1 Accuracy66.4
49
Zero-shot LearningAWA2 (unseen)
Top-1 Acc78.6
37
Image ClassificationAWA2 GZSL
Acc (Unseen)32.4
32
Image ClassificationAWA1
Test Set Score (ts)29.6
30
Zero-shot recognitionAwA1 (test)
Top-1 Accuracy68.2
25
Zero-shot recognitionCUB pure ZSL setting (target classes)
Top-1 Accuracy55.8
18
Showing 10 of 21 rows

Other info

Follow for update