Weakly Supervised Deep Detection Networks
About
Weakly supervised learning of object detection is an important problem in image understanding that still does not have a satisfactory solution. In this paper, we address this problem by exploiting the power of deep convolutional neural networks pre-trained on large-scale image-level classification tasks. We propose a weakly supervised deep detection architecture that modifies one such network to operate at the level of image regions, performing simultaneously region selection and classification. Trained as an image classifier, the architecture implicitly learns object detectors that are better than alternative weakly supervised detection systems on the PASCAL VOC data. The model, which is a simple and elegant end-to-end architecture, outperforms standard data augmentation and fine-tuning techniques for the task of image-level classification as well.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Detection | PASCAL VOC 2007 (test) | mAP39.3 | 821 | |
| Object Detection | COCO (val) | mAP19.6 | 613 | |
| Video Object Detection | ImageNet VID (val) | -- | 341 | |
| Object Detection | PASCAL VOC 2012 (test) | mAP31.4 | 270 | |
| Object Detection | MS-COCO 2017 (val) | -- | 237 | |
| Classification | PASCAL VOC 2007 (test) | mAP (%)89.7 | 217 | |
| Object Localization | PASCAL VOC 2007 (trainval) | CorLoc58 | 118 | |
| Object Detection | Watercolor2k (test) | mAP (Overall)12.7 | 113 | |
| Object Detection | MS-COCO (test) | AP11.5 | 81 | |
| Object Detection | Clipart1k (test) | mAP4.4 | 70 |