ICNet for Real-Time Semantic Segmentation on High-Resolution Images

About

We focus on the challenging task of real-time semantic segmentation in this paper. It finds many practical applications and yet is with fundamental difficulty of reducing a large portion of computation for pixel-wise label inference. We propose an image cascade network (ICNet) that incorporates multi-resolution branches under proper label guidance to address this challenge. We provide in-depth analysis of our framework and introduce the cascade feature fusion unit to quickly achieve high-quality segmentation. Our system yields real-time inference on a single GPU card with decent quality results evaluated on challenging datasets like Cityscapes, CamVid and COCO-Stuff.

Hengshuang Zhao, Xiaojuan Qi, Xiaoyong Shen, Jianping Shi, Jiaya Jia• 2017

Related benchmarks

Task	Dataset	Result
Semantic segmentation	Cityscapes (test)	mIoU70.6	1145
Semantic segmentation	Cityscapes (val)	mIoU67.7	572
Semantic segmentation	CamVid (test)	mIoU67.1	411
Semantic segmentation	Cityscapes (val)	mIoU69.5	287
Semantic segmentation	COCO Stuff	mIoU29.1	195
Semantic segmentation	Coco-Stuff (test)	mIoU29.1	184
Semantic segmentation	Mapillary (val)	mIoU42.8	153
Semantic segmentation	COCO Stuff (val)	mIoU29.1	126
Semantic segmentation	Cityscapes (val)	mIoU67.7	108
Semantic segmentation	Trans10K v2 (test)	mIoU23.39	104

Showing 10 of 34 rows

Other info

Code

Follow for update

@wizwand_team Discord