A Generative Adversarial Approach for Zero-Shot Learning from Noisy Texts

About

Most existing zero-shot learning methods consider the problem as a visual semantic embedding one. Given the demonstrated capability of Generative Adversarial Networks(GANs) to generate images, we instead leverage GANs to imagine unseen categories from text descriptions and hence recognize novel classes with no examples being seen. Specifically, we propose a simple yet effective generative model that takes as input noisy text descriptions about an unseen class (e.g.Wikipedia articles) and generates synthesized visual features for this class. With added pseudo data, zero-shot learning is naturally converted to a traditional classification problem. Additionally, to preserve the inter-class discrimination of the generated features, a visual pivot regularization is proposed as an explicit supervision. Unlike previous methods using complex engineered regularizers, our approach can suppress the noise well without additional regularization. Empirically, we show that our method consistently outperforms the state of the art on the largest available benchmarks on Text-based Zero-shot Learning.

Yizhe Zhu, Mohamed Elhoseiny, Bingchen Liu, Xi Peng, Ahmed Elgammal• 2017

Related benchmarks

Task	Dataset	Result
Zero-shot Learning	CUB	Top-1 Accuracy55.8	183
Image Classification	CUB	Harmonic Mean Top-1 Acc43.1	106
Image Classification	SUN	Harmonic Mean Top-1 Accuracy30.8	86
Zero-shot Learning	SUN (unseen)	Top-1 Accuracy (%)62.8	50
Zero-shot Learning	CUB (unseen)	Top-1 Accuracy66.4	49
Image Classification	AWA2 GZSL	H (Harmonic Mean)47.4	49
Zero-shot Learning	AWA2 (unseen)	Top-1 Acc78.6	37
Image Classification	AWA1	Test Set Score (ts)29.6	30
Zero-shot recognition	AwA1 (test)	Top-1 Accuracy68.2	25
Zero-shot recognition	CUB pure ZSL setting (target classes)	Top-1 Accuracy55.8	18

Showing 10 of 21 rows

Other info

Follow for update

@wizwand_team Discord