Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Soft Proposal Networks for Weakly Supervised Object Localization

About

Weakly supervised object localization remains challenging, where only image labels instead of bounding boxes are available during training. Object proposal is an effective component in localization, but often computationally expensive and incapable of joint optimization with some of the remaining modules. In this paper, to the best of our knowledge, we for the first time integrate weakly supervised object proposal into convolutional neural networks (CNNs) in an end-to-end learning manner. We design a network component, Soft Proposal (SP), to be plugged into any standard convolutional architecture to introduce the nearly cost-free object proposal, orders of magnitude faster than state-of-the-art methods. In the SP-augmented CNNs, referred to as Soft Proposal Networks (SPNs), iteratively evolved object proposals are generated based on the deep feature maps then projected back, and further jointly optimized with network parameters, with image-level supervision only. Through the unified learning process, SPNs learn better object-centric filters, discover more discriminative visual evidence, and suppress background interference, significantly boosting both weakly supervised object localization and classification performance. We report the best results on popular benchmarks, including PASCAL VOC, MS COCO, and ImageNet.

Yi Zhu, Yanzhao Zhou, Qixiang Ye, Qiang Qiu, Jianbin Jiao• 2017

Related benchmarks

TaskDatasetResultRank
Instance SegmentationPASCAL VOC 2012 (val)
mAP @0.512.7
173
Object LocalizationPASCAL VOC 2007 (trainval)
CorLoc60.6
118
Pointing localizationVOC 2007 (test)--
44
Object DetectionMS-COCO 2014 (val)--
41
Pointing localizationVOC Difficult 2007 (test)
Accuracy79.1
21
Pointwise LocalizationPASCAL VOC 2012 (val)
mAP82.9
10
Pointwise LocalizationMS-COCO 2014 (val)
mAP55.3
6
Bounding box localizationILSVRC 2014 (val)
Localization Error36.3
5
Pointing localizationMSCOCO 2014 (test)
mAP55.3
5
Pointing-with-predictionCOCO 2014 (val)
mAP55.3
4
Showing 10 of 12 rows

Other info

Code

Follow for update