ParseNet: Looking Wider to See Better

About

We present a technique for adding global context to deep convolutional networks for semantic segmentation. The approach is simple, using the average feature for a layer to augment the features at each location. In addition, we study several idiosyncrasies of training, significantly increasing the performance of baseline networks (e.g. from FCN). When we add our proposed global feature, and a technique for learning normalization parameters, accuracy increases consistently even over our improved versions of the baselines. Our proposed approach, ParseNet, achieves state-of-the-art performance on SiftFlow and PASCAL-Context with small additional computational cost over baselines, and near current state-of-the-art performance on PASCAL VOC 2012 semantic segmentation with a simple approach. Code is available at https://github.com/weiliu89/caffe/tree/fcn .

Wei Liu, Andrew Rabinovich, Alexander C. Berg• 2015

Related benchmarks

Task	Dataset	Result
Semantic segmentation	PASCAL VOC 2012 (test)	mIoU69.8	1477
Semantic segmentation	PASCAL Context (val)	mIoU40.4	360
Semantic segmentation	ScanNet (val)	mIoU47.72	302
Semantic segmentation	Pascal VOC (test)	mIoU69.8	268
Semantic segmentation	Pascal Context	mIoU40.4	217
3D Semantic Segmentation	ScanNet V2 (val)	mIoU47.72	209
Semantic segmentation	Pascal Context 60	mIoU40.4	139
Semantic segmentation	PASCAL-Context 59 classes (test)	mIoU40.4	75
Semantic segmentation	SYNTHIA (val)	mIoU71.02	71
Semantic segmentation	PASCAL-Context 60 classes (test)	mIoU40.4	54

Showing 10 of 13 rows

Other info

Code

Follow for update

@wizwand_team Discord