Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Attributes as Operators: Factorizing Unseen Attribute-Object Compositions

About

We present a new approach to modeling visual attributes. Prior work casts attributes in a similar role as objects, learning a latent representation where properties (e.g., sliced) are recognized by classifiers much in the way objects (e.g., apple) are. However, this common approach fails to separate the attributes observed during training from the objects with which they are composed, making it ineffectual when encountering new attribute-object compositions. Instead, we propose to model attributes as operators. Our approach learns a semantic embedding that explicitly factors out attributes from their accompanying objects, and also benefits from novel regularizers expressing attribute operators' effects (e.g., blunt should undo the effects of sharp). Not only does our approach align conceptually with the linguistic role of attributes as modifiers, but it also generalizes to recognize unseen compositions of objects and attributes. We validate our approach on two challenging datasets and demonstrate significant improvements over the state-of-the-art. In addition, we show that not only can our model recognize unseen compositions robustly in an open-world setting, it can also generalize to compositions where objects themselves were unseen during training.

Tushar Nagarajan, Kristen Grauman• 2018

Related benchmarks

TaskDatasetResultRank
Generalized Compositional Zero-Shot LearningC-GQA (test)
AUC0.3
46
Compositional Zero-Shot LearningUT-Zappos open world
HM29.4
38
Compositional Zero-Shot LearningMIT-States open world
HM4.7
38
RetrievalCUB unseen attributes modified (novel)
mAP@500.322
15
Attribute ClassificationCUB unseen attributes novel modified
mAUROC67
15
LocalizationCUB unseen attributes modified (novel)
mLA49
15
Compositional Zero-Shot LearningVAW CZSL (test)
HM9.8
14
Generalized Compositional Zero-Shot LearningMIT-States (test)
AUC (1)1.6
12
Compositional Zero-Shot LearningMIT-States (test)
Top-1 Acc14.2
11
Generalized Compositional Zero-Shot LearningUT-Zap50K (test)
AUC25.9
10
Showing 10 of 23 rows

Other info

Code

Follow for update