Deep Self-Taught Learning for Weakly Supervised Object Localization
About
Most existing weakly supervised localization (WSL) approaches learn detectors by finding positive bounding boxes based on features learned with image-level supervision. However, those features do not contain spatial location related information and usually provide poor-quality positive samples for training a detector. To overcome this issue, we propose a deep self-taught learning approach, which makes the detector learn the object-level features reliable for acquiring tight positive samples and afterwards re-train itself based on them. Consequently, the detector progressively improves its detection ability and localizes more informative positive samples. To implement such self-taught learning, we propose a seed sample acquisition method via image-to-object transferring and dense subgraph discovery to find reliable positive samples for initializing the detector. An online supportive sample harvesting scheme is further proposed to dynamically select the most confident tight positive samples and train the detector in a mutual boosting way. To prevent the detector from being trapped in poor optima due to overfitting, we propose a new relative improvement of predicted CNN scores for guiding the self-taught learning process. Extensive experiments on PASCAL 2007 and 2012 show that our approach outperforms the state-of-the-arts, strongly validating its effectiveness.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Detection | PASCAL VOC 2007 (test) | mAP43.7 | 821 | |
| Object Detection | PASCAL VOC 2012 (test) | mAP39.4 | 270 | |
| Object Localization | PASCAL VOC 2007 (trainval) | CorLoc56.1 | 118 | |
| Weakly Supervised Object Localization | PASCAL VOC 2007 (trainval) | CorLoc (Aero)73.9 | 54 | |
| Object Localization | PASCAL VOC 2012 (trainval) | CorLoc58.8 | 51 | |
| Object Detection | PASCAL VOC 2010 (test) | mAP38.3 | 31 | |
| Object Detection | PASCAL VOC 2012 (val) | Mean AP^b39 | 27 | |
| Weakly Supervised Object Localization | PASCAL VOC 2012 (trainval) | Aero CorLoc82.4 | 21 | |
| Weakly Supervised Object Detection | VOC 2007 (trainval) | CorLoc56.1 | 12 | |
| Object Detection | VOC 2012 (trainval) | CorLoc58.8 | 11 |