Single-Shot Object Detection with Enriched Semantics

About

We propose a novel single shot object detection network named Detection with Enriched Semantics (DES). Our motivation is to enrich the semantics of object detection features within a typical deep detector, by a semantic segmentation branch and a global activation module. The segmentation branch is supervised by weak segmentation ground-truth, i.e., no extra annotation is required. In conjunction with that, we employ a global activation module which learns relationship between channels and object classes in a self-supervised manner. Comprehensive experimental results on both PASCAL VOC and MS COCO detection datasets demonstrate the effectiveness of the proposed method. In particular, with a VGG16 based DES, we achieve an mAP of 81.7 on VOC2007 test and an mAP of 32.8 on COCO test-dev with an inference speed of 31.5 milliseconds per image on a Titan Xp GPU. With a lower resolution version, we achieve an mAP of 79.7 on VOC2007 with an inference speed of 13.0 milliseconds per image.

Zhishuai Zhang, Siyuan Qiao, Cihang Xie, Wei Shen, Bo Wang, Alan L. Yuille• 2017

Related benchmarks

Task	Dataset	Result
Object Detection	COCO (test-dev)	mAP32.8	1239
Object Detection	PASCAL VOC 2007 (test)	mAP81.7	844
Object Detection	PASCAL VOC 2012 (test)	mAP80.3	293
Object Detection	VOC 2007 (test)	AP@5081.7	91
Object Detection	VOC 2012 (test)	mAP@.5080.3	69

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord