Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation

About

Semantic segmentation requires both rich spatial information and sizeable receptive field. However, modern approaches usually compromise spatial resolution to achieve real-time inference speed, which leads to poor performance. In this paper, we address this dilemma with a novel Bilateral Segmentation Network (BiSeNet). We first design a Spatial Path with a small stride to preserve the spatial information and generate high-resolution features. Meanwhile, a Context Path with a fast downsampling strategy is employed to obtain sufficient receptive field. On top of the two paths, we introduce a new Feature Fusion Module to combine features efficiently. The proposed architecture makes a right balance between the speed and segmentation performance on Cityscapes, CamVid, and COCO-Stuff datasets. Specifically, for a 2048x1024 input, we achieve 68.4% Mean IOU on the Cityscapes test dataset with speed of 105 FPS on one NVIDIA Titan XP card, which is significantly faster than the existing methods with comparable performance.

Changqian Yu, Jingbo Wang, Chao Peng, Changxin Gao, Gang Yu, Nong Sang• 2018

Related benchmarks

TaskDatasetResultRank
Semantic segmentationADE20K (val)
mIoU35.78
2731
Semantic segmentationCityscapes (test)
mIoU78.9
1145
Semantic segmentationADE20K
mIoU35.1
936
Semantic segmentationCityscapes
mIoU74.8
578
Semantic segmentationCityscapes (val)
mIoU80.3
572
Semantic segmentationCamVid (test)
mIoU68.7
411
Semantic segmentationSemanticKITTI (test)
mIoU37.1
335
Semantic segmentationCityscapes (val)
mIoU74.8
332
Semantic segmentationCityscapes (val)
mIoU74.8
287
Semantic segmentationCOCO Stuff
mIoU25.6
195
Showing 10 of 55 rows

Other info

Follow for update