Weakly Supervised Localization using Deep Feature Maps
About
Object localization is an important computer vision problem with a variety of applications. The lack of large scale object-level annotations and the relative abundance of image-level labels makes a compelling case for weak supervision in the object localization task. Deep Convolutional Neural Networks are a class of state-of-the-art methods for the related problem of object recognition. In this paper, we describe a novel object localization algorithm which uses classification networks trained on only image labels. This weakly supervised method leverages local spatial and semantic patterns captured in the convolutional layers of classification networks. We propose an efficient beam search based approach to detect and localize multiple objects in images. The proposed method significantly outperforms the state-of-the-art in standard object localization data-sets with a 8 point increase in mAP scores.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Detection | PASCAL VOC 2007 (test) | mAP25.7 | 821 | |
| Object Detection | MS-COCO 2014 (val) | -- | 41 | |
| Object Detection | PASCAL VOC 2012 (val) | Mean AP^b79.7 | 27 | |
| Pointwise Localization | PASCAL VOC 2012 (val) | mAP79.7 | 10 | |
| Image Classification | PASCAL VOC 2012 (val) | mAP86.5 | 8 | |
| Pointwise Localization | MS-COCO 2014 (val) | mAP49.2 | 6 | |
| Pointing localization | MSCOCO 2014 (test) | mAP49.2 | 5 | |
| Pointing-with-prediction | COCO 2014 (val) | mAP49.2 | 4 | |
| Weakly Supervised Object Localization | MS-COCO (val) | mAP49.2 | 2 |