ParseNet: Looking Wider to See Better
About
We present a technique for adding global context to deep convolutional networks for semantic segmentation. The approach is simple, using the average feature for a layer to augment the features at each location. In addition, we study several idiosyncrasies of training, significantly increasing the performance of baseline networks (e.g. from FCN). When we add our proposed global feature, and a technique for learning normalization parameters, accuracy increases consistently even over our improved versions of the baselines. Our proposed approach, ParseNet, achieves state-of-the-art performance on SiftFlow and PASCAL-Context with small additional computational cost over baselines, and near current state-of-the-art performance on PASCAL VOC 2012 semantic segmentation with a simple approach. Code is available at https://github.com/weiliu89/caffe/tree/fcn .
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | PASCAL VOC 2012 (test) | mIoU69.8 | 1342 | |
| Semantic segmentation | PASCAL Context (val) | mIoU40.4 | 323 | |
| Semantic segmentation | Pascal VOC (test) | mIoU69.8 | 236 | |
| Semantic segmentation | ScanNet (val) | mIoU47.72 | 231 | |
| 3D Semantic Segmentation | ScanNet V2 (val) | mIoU47.72 | 171 | |
| Semantic segmentation | Pascal Context | mIoU40.4 | 111 | |
| Semantic segmentation | Pascal Context 60 | mIoU40.4 | 81 | |
| Semantic segmentation | PASCAL-Context 59 classes (test) | mIoU40.4 | 75 | |
| Semantic segmentation | SYNTHIA (val) | mIoU71.02 | 71 | |
| Semantic segmentation | PASCAL-Context 60 classes (test) | mIoU40.4 | 54 |