Multi-Label Image Recognition with Graph Convolutional Networks
About
The task of multi-label image recognition is to predict a set of object labels that present in an image. As objects normally co-occur in an image, it is desirable to model the label dependencies to improve the recognition performance. To capture and explore such important dependencies, we propose a multi-label classification model based on Graph Convolutional Network (GCN). The model builds a directed graph over the object labels, where each node (label) is represented by word embeddings of a label, and GCN is learned to map this label graph into a set of inter-dependent object classifiers. These classifiers are applied to the image descriptors extracted by another sub-net, enabling the whole network to be end-to-end trainable. Furthermore, we propose a novel re-weighted scheme to create an effective label correlation matrix to guide information propagation among the nodes in GCN. Experiments on two multi-label image recognition datasets show that our approach obviously outperforms other existing state-of-the-art methods. In addition, visualization analyses reveal that the classifiers learned by our model maintain meaningful semantic topology.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-Label Classification | PASCAL VOC 2007 (test) | mAP94 | 125 | |
| Multi-Label Classification | NUS-WIDE (test) | mAP62.8 | 112 | |
| Multi-Label Classification | MS-COCO 2014 (test) | mAP83 | 81 | |
| Multi-label recognition | VG-200 | Avg OF142.5 | 66 | |
| Multi-label recognition | PASCAL VOC 2007 | Avg OF189.2 | 66 | |
| Multi-label recognition | MS-COCO | Overall F1 Score (OF1)76.4 | 66 | |
| Multi-label image recognition | VOC 2007 (test) | mAP94 | 61 | |
| Multi-Label Classification | VOC 07 | mAP88.9 | 61 | |
| Multi-label image recognition | MS-COCO 2014 (val) | mAP87.5 | 51 | |
| Multi-Label Classification | MS-COCO (val) | mAP83 | 47 |