Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Class Normalization for (Continual)? Generalized Zero-Shot Learning

About

Normalization techniques have proved to be a crucial ingredient of successful training in a traditional supervised learning regime. However, in the zero-shot learning (ZSL) world, these ideas have received only marginal attention. This work studies normalization in ZSL scenario from both theoretical and practical perspectives. First, we give a theoretical explanation to two popular tricks used in zero-shot learning: normalize+scale and attributes normalization and show that they help training by preserving variance during a forward pass. Next, we demonstrate that they are insufficient to normalize a deep ZSL model and propose Class Normalization (CN): a normalization scheme, which alleviates this issue both provably and in practice. Third, we show that ZSL models typically have more irregular loss surface compared to traditional classifiers and that the proposed method partially remedies this problem. Then, we test our approach on 4 standard ZSL datasets and outperform sophisticated modern SotA with a simple MLP optimized without any bells and whistles and having ~50 times faster training speed. Finally, we generalize ZSL to a broader problem -- continual ZSL, and introduce some principled metrics and rigorous baselines for this new setup. The project page is located at https://universome.github.io/class-norm.

Ivan Skorokhodov, Mohamed Elhoseiny• 2020

Related benchmarks

TaskDatasetResultRank
Generalized Zero-Shot LearningCUB
H Score54.9
250
Generalized Zero-Shot LearningSUN--
184
Generalized Zero-Shot LearningAWA2
S Score84.25
165
Continual Generalized Zero-Shot LearningAWA2
Mean Seen Accuracy (mSA)89.22
24
Continual Generalized Zero-Shot LearningCUB
Mean Accuracy (mSA)64.91
24
Continual Generalized Zero-Shot LearningSUN
mSA50.56
23
Continual Generalized Zero-Shot LearningaPY
Seen Accuracy (mSA)79.6
22
Continual Generalized Zero-Shot LearningAWA1
mSA (Seen)75.59
22
Dynamic Continual Generalized Zero-Shot LearningCUB (test)
Forgetting8
10
Dynamic Continual Generalized Zero-Shot LearningAWA2 (test)
Forgetting Measure10
10
Showing 10 of 10 rows

Other info

Follow for update