Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Visual-Augmented Dynamic Semantic Prototype for Generative Zero-Shot Learning

About

Generative Zero-shot learning (ZSL) learns a generator to synthesize visual samples for unseen classes, which is an effective way to advance ZSL. However, existing generative methods rely on the conditions of Gaussian noise and the predefined semantic prototype, which limit the generator only optimized on specific seen classes rather than characterizing each visual instance, resulting in poor generalizations (\textit{e.g.}, overfitting to seen classes). To address this issue, we propose a novel Visual-Augmented Dynamic Semantic prototype method (termed VADS) to boost the generator to learn accurate semantic-visual mapping by fully exploiting the visual-augmented knowledge into semantic conditions. In detail, VADS consists of two modules: (1) Visual-aware Domain Knowledge Learning module (VDKL) learns the local bias and global prior of the visual features (referred to as domain visual knowledge), which replace pure Gaussian noise to provide richer prior noise information; (2) Vision-Oriented Semantic Updation module (VOSU) updates the semantic prototype according to the visual representations of the samples. Ultimately, we concatenate their output as a dynamic semantic prototype, which serves as the condition of the generator. Extensive experiments demonstrate that our VADS achieves superior CZSL and GZSL performances on three prominent datasets and outperforms other state-of-the-art methods with averaging increases by 6.4\%, 5.9\% and 4.2\% on SUN, CUB and AWA2, respectively.

Wenjin Hou, Shiming Chen, Shuhuang Chen, Ziming Hong, Yan Wang, Xuetao Feng, Salman Khan, Fahad Shahbaz Khan, Xinge You• 2024

Related benchmarks

TaskDatasetResultRank
Generalized Zero-Shot LearningCUB
H Score74.3
250
Generalized Zero-Shot LearningSUN
H55.7
184
Generalized Zero-Shot LearningAWA2
S Score83.6
165
Zero-shot LearningCUB
Top-1 Accuracy86.8
144
Zero-shot LearningSUN
Top-1 Accuracy76.3
114
Zero-shot LearningAWA2
Top-1 Accuracy0.825
95
Image ClassificationCUB
Unseen Top-1 Acc74.1
89
Image ClassificationAWA2 GZSL
Acc (Unseen)75.4
32
Image ClassificationSUN GZSL
Top-1 Acc (Unseen)64.6
14
Image ClassificationAWA2 ZSL
Top-1 Acc82.5
11
Showing 10 of 11 rows

Other info

Follow for update