Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Graph Embeddings for Compositional Zero-shot Learning

About

In compositional zero-shot learning, the goal is to recognize unseen compositions (e.g. old dog) of observed visual primitives states (e.g. old, cute) and objects (e.g. car, dog) in the training set. This is challenging because the same state can for example alter the visual appearance of a dog drastically differently from a car. As a solution, we propose a novel graph formulation called Compositional Graph Embedding (CGE) that learns image features, compositional classifiers, and latent representations of visual primitives in an end-to-end manner. The key to our approach is exploiting the dependency between states, objects, and their compositions within a graph structure to enforce the relevant knowledge transfer from seen to unseen compositions. By learning a joint compatibility that encodes semantics between concepts, our model allows for generalization to unseen compositions without relying on an external knowledge base like WordNet. We show that in the challenging generalized compositional zero-shot setting our CGE significantly outperforms the state of the art on MIT-States and UT-Zappos. We also propose a new benchmark for this task based on the recent GQA dataset. Code is available at: https://github.com/ExplainableML/czsl

Muhammad Ferjad Naeem, Yongqin Xian, Federico Tombari, Zeynep Akata• 2021

Related benchmarks

TaskDatasetResultRank
Generalized Compositional Zero-Shot LearningC-GQA (test)
AUC3.7
46
Compositional Zero-Shot LearningUT-Zappos Closed World
HM60.5
42
Compositional Zero-Shot LearningC-GQA Closed World
HM15.5
41
Compositional Zero-Shot LearningUT-Zappos open world
HM39
38
Compositional Zero-Shot LearningMIT-States open world
HM6
38
Compositional Zero-Shot LearningC-GQA open world
HM Score2.9
35
Compositional Zero-Shot LearningMIT-States Closed World
Harmonic Mean (HM)0.214
32
RetrievalCUB unseen attributes modified (novel)
mAP@500.332
15
Attribute ClassificationCUB unseen attributes novel modified
mAUROC67.1
15
LocalizationCUB unseen attributes modified (novel)
mLA44.4
15
Showing 10 of 26 rows

Other info

Code

Follow for update