Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Simultaneous Detection and Segmentation

About

We aim to detect all instances of a category in an image and, for each instance, mark the pixels that belong to it. We call this task Simultaneous Detection and Segmentation (SDS). Unlike classical bounding box detection, SDS requires a segmentation and not just a box. Unlike classical semantic segmentation, we require individual object instances. We build on recent work that uses convolutional neural networks to classify category-independent region proposals (R-CNN [16]), introducing a novel architecture tailored for SDS. We then use category-specific, top- down figure-ground predictions to refine our bottom-up proposals. We show a 7 point boost (16% relative) over our baselines on SDS, a 5 point boost (10% relative) over state-of-the-art on semantic segmentation, and state-of-the-art performance in object detection. Finally, we provide diagnostic tools that unpack performance and provide directions for future work.

Bharath Hariharan, Pablo Arbel\'aez, Ross Girshick, Jitendra Malik• 2014

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPASCAL VOC 2012 (test)
mIoU51.6
1342
Object DetectionPASCAL VOC 2012 (test)
mAP50.7
270
Instance SegmentationPASCAL VOC 2012 (val)
mAP @0.549.7
173
Instance SegmentationCityscapes (test)
AP (Overall)4.6
122
Object DetectionPASCAL VOC 2012 (val)
Mean AP^b53.9
27
Instance SegmentationVOC 2012 (val)
AP^r @ IoU=0.543.8
13
Simultaneous Detection and SegmentationVOC 2012 (val)
AP^r (aeroplane)68.4
10
Semantic segmentationVOC 2011 (test)
mIoU52.6
9
Semantic segmentationPASCAL VOC 2011 (test)
mIoU52.6
9
Showing 9 of 9 rows

Other info

Code

Follow for update