Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Recurrent Attentional Reinforcement Learning for Multi-label Image Recognition

About

Recognizing multiple labels of images is a fundamental but challenging task in computer vision, and remarkable progress has been attained by localizing semantic-aware image regions and predicting their labels with deep convolutional neural networks. The step of hypothesis regions (region proposals) localization in these existing multi-label image recognition pipelines, however, usually takes redundant computation cost, e.g., generating hundreds of meaningless proposals with non-discriminative information and extracting their features, and the spatial contextual dependency modeling among the localized regions are often ignored or over-simplified. To resolve these issues, this paper proposes a recurrent attention reinforcement learning framework to iteratively discover a sequence of attentional and informative regions that are related to different semantic objects and further predict label scores conditioned on these regions. Besides, our method explicitly models long-term dependencies among these attentional regions that help to capture semantic label co-occurrence and thus facilitate multi-label recognition. Extensive experiments and comparisons on two large-scale benchmarks (i.e., PASCAL VOC and MS-COCO) show that our model achieves superior performance over existing state-of-the-art methods in both performance and efficiency as well as explicitly identifying image-level semantic labels to specific object regions.

Tianshui Chen, Zhouxia Wang, Guanbin Li, Liang Lin• 2017

Related benchmarks

TaskDatasetResultRank
Multi-Label ClassificationPASCAL VOC 2007 (test)
mAP92
125
Multi-label image recognitionVOC 2007 (test)
mAP92
61
Multi-label Image ClassificationPASCAL VOC 2007
mAP92
25
Multi-label Image ClassificationMS-COCO (val)
F1 (C)66.2
25
Multi-label Image ClassificationMS-COCO (test)--
24
Multi-label image recognitionMS-COCO (val)
CP78.8
23
Multi-label Image ClassificationPascal VOC
mAP92
7
Showing 7 of 7 rows

Other info

Follow for update