Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation

About

Recent leading approaches to semantic segmentation rely on deep convolutional networks trained with human-annotated, pixel-level segmentation masks. Such pixel-accurate supervision demands expensive labeling effort and limits the performance of deep networks that usually benefit from more training data. In this paper, we propose a method that achieves competitive accuracy but only requires easily obtained bounding box annotations. The basic idea is to iterate between automatically generating region proposals and training convolutional networks. These two steps gradually recover segmentation masks for improving the networks, and vise versa. Our method, called BoxSup, produces competitive results supervised by boxes only, on par with strong baselines fully supervised by masks under the same setting. By leveraging a large amount of bounding boxes, BoxSup further unleashes the power of deep convolutional networks and yields state-of-the-art results on PASCAL VOC 2012 and PASCAL-CONTEXT.

Jifeng Dai, Kaiming He, Jian Sun• 2015

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPASCAL VOC 2012 (val)
Mean IoU63.8
2040
Semantic segmentationPASCAL VOC 2012 (test)
mIoU75.2
1342
Semantic segmentationPASCAL VOC (val)
mIoU63.8
338
Semantic segmentationPASCAL Context (val)
mIoU40.5
323
Semantic segmentationPascal VOC (test)
mIoU75.2
236
Semantic segmentationPascal Context (test)
mIoU40.5
176
Weakly supervised semantic segmentationPASCAL VOC 2012 (test)
mIoU64.6
158
Semantic segmentationPASCAL-Context 59 class (val)
mIoU40.5
125
Semantic segmentationPascal Context 60
mIoU40.5
81
Semantic segmentationPASCAL-Context 59 classes (test)
mIoU40.5
75
Showing 10 of 14 rows

Other info

Follow for update