ICNet for Real-Time Semantic Segmentation on High-Resolution Images
About
We focus on the challenging task of real-time semantic segmentation in this paper. It finds many practical applications and yet is with fundamental difficulty of reducing a large portion of computation for pixel-wise label inference. We propose an image cascade network (ICNet) that incorporates multi-resolution branches under proper label guidance to address this challenge. We provide in-depth analysis of our framework and introduce the cascade feature fusion unit to quickly achieve high-quality segmentation. Our system yields real-time inference on a single GPU card with decent quality results evaluated on challenging datasets like Cityscapes, CamVid and COCO-Stuff.
Hengshuang Zhao, Xiaojuan Qi, Xiaoyong Shen, Jianping Shi, Jiaya Jia• 2017
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | Cityscapes (test) | mIoU70.6 | 1145 | |
| Semantic segmentation | Cityscapes (val) | mIoU67.7 | 572 | |
| Semantic segmentation | CamVid (test) | mIoU67.1 | 411 | |
| Semantic segmentation | Cityscapes (val) | mIoU69.5 | 287 | |
| Semantic segmentation | COCO Stuff | mIoU29.1 | 195 | |
| Semantic segmentation | Coco-Stuff (test) | mIoU29.1 | 184 | |
| Semantic segmentation | Mapillary (val) | mIoU42.8 | 153 | |
| Semantic segmentation | COCO Stuff (val) | mIoU29.1 | 126 | |
| Semantic segmentation | Cityscapes (val) | mIoU67.7 | 108 | |
| Semantic segmentation | Trans10K v2 (test) | mIoU23.39 | 104 |
Showing 10 of 34 rows