Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Zero-shot Recognition via Semantic Embeddings and Knowledge Graphs

About

We consider the problem of zero-shot recognition: learning a visual classifier for a category with zero training examples, just using the word embedding of the category and its relationship to other categories, which visual data are provided. The key to dealing with the unfamiliar or novel category is to transfer knowledge obtained from familiar classes to describe the unfamiliar class. In this paper, we build upon the recently introduced Graph Convolutional Network (GCN) and propose an approach that uses both semantic embeddings and the categorical relationships to predict the classifiers. Given a learned knowledge graph (KG), our approach takes as input semantic embeddings for each node (representing visual category). After a series of graph convolutions, we predict the visual classifier for each category. During training, the visual classifiers for a few categories are given to learn the GCN parameters. At test time, these filters are used to predict the visual classifiers of unseen categories. We show that our approach is robust to noise in the KG. More importantly, our approach provides significant improvement in performance compared to the current state-of-the-art results (from 2 ~ 3% on some metrics to whopping 20% on a few).

Xiaolong Wang, Yufei Ye, Abhinav Gupta• 2018

Related benchmarks

TaskDatasetResultRank
Zero-shot LearningAWA2
Top-1 Accuracy0.77
95
Zero-Shot Object ClassificationaPY
U Score49.1
16
Image ClassificationImageNet 2-hop split
Flat Hit@119.8
15
Image ClassificationImageNet 3-hop split
Flat Hit@14.1
15
Object ClassificationImageNet All
Top-1 Accuracy1.8
8
Intent ClassificationSNIPS-NLU (test)
Accuracy82.47
7
Zero-shot Image ClassificationImageNet 2-hop (Unseen categories only)
Top-1 Acc20.9
5
Zero-shot Image ClassificationImageNet 3-hop (Unseen categories only)
Top-1 Accuracy4.3
5
Zero-shot Image ClassificationImageNet 2-hop (Unseen & seen categories)
Top-1 Accuracy10
4
Zero-shot Image ClassificationImageNet 3-hop Unseen & seen categories
Top-1 Accuracy2.4
4
Showing 10 of 10 rows

Other info

Follow for update