Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition

About

Multi-label image recognition is a practical and challenging task compared to single-label image classification. However, previous works may be suboptimal because of a great number of object proposals or complex attentional region generation modules. In this paper, we propose a simple but efficient two-stream framework to recognize multi-category objects from global image to local regions, similar to how human beings perceive objects. To bridge the gap between global and local streams, we propose a multi-class attentional region module which aims to make the number of attentional regions as small as possible and keep the diversity of these regions as high as possible. Our method can efficiently and effectively recognize multi-class objects with an affordable computation cost and a parameter-free region localization module. Over three benchmarks on multi-label image classification, we create new state-of-the-art results with a single model only using image semantics without label dependency. In addition, the effectiveness of the proposed method is extensively demonstrated under different factors such as global pooling strategy, input size and network architecture. Code has been made available at~\url{https://github.com/gaobb/MCAR}.

Bin-Bin Gao, Hong-Yu Zhou• 2020

Related benchmarks

TaskDatasetResultRank
Multi-Label ClassificationPASCAL VOC 2007 (test)
mAP94.8
125
Multi-Label ClassificationMS-COCO 2014 (test)
mAP83.8
81
Multi-label Image ClassificationVOC 2012 (test)
mAP94.3
72
Multi-label image recognitionVOC 2007 (test)
mAP92.9
61
Multi-label image recognitionMS-COCO 2014 (val)
mAP83.8
51
Multi-Label ClassificationMS-COCO (val)
mAP84.5
47
Multi-label Image ClassificationMS-COCO 2014 (test)
F1 Score (Top-3)75.3
24
Multi-Label ClassificationMS-COCO (test)
mAP83.8
24
Multi-Label ClassificationCOCO originally multi-label (test val)
mAP84.5
15
Multi-label Image ClassificationVOC 2012
AP (aero)99.6
10
Showing 10 of 10 rows

Other info

Code

Follow for update