SegStereo: Exploiting Semantic Information for Disparity Estimation

About

Disparity estimation for binocular stereo images finds a wide range of applications. Traditional algorithms may fail on featureless regions, which could be handled by high-level clues such as semantic segments. In this paper, we suggest that appropriate incorporation of semantic cues can greatly rectify prediction in commonly-used disparity estimation frameworks. Our method conducts semantic feature embedding and regularizes semantic cues as the loss term to improve learning disparity. Our unified model SegStereo employs semantic features from segmentation and introduces semantic softmax loss, which helps improve the prediction accuracy of disparity maps. The semantic cues work well in both unsupervised and supervised manners. SegStereo achieves state-of-the-art results on KITTI Stereo benchmark and produces decent prediction on both CityScapes and FlyingThings3D datasets.

Guorun Yang, Hengshuang Zhao, Jianping Shi, Zhidong Deng, Jiaya Jia• 2018

Related benchmarks

Task	Dataset	Result
Stereo Matching	KITTI 2015 (test)	D1 Error (Overall)0.0744	245
Stereo Matching	KITTI 2015	D1 Error (All)2.25	142
Stereo Matching	KITTI 2012 (test)	Outlier Rate (3px, Noc)1.68	105
Disparity Estimation	KITTI 2015 (test)	D1 Error (bg, all)1.88	77
Stereo Matching	KITTI Noc 2015	D1 Error (Background)1.76	42
Stereo Matching	KITTI 2012 (Noc)	Error Rate (>2px)2.66	26
Stereo Matching	KITTI 2012 (All split)	Error Rate (>2px)3.19	26
Disparity Estimation	Scene Flow (test)	EPE1.45	24
Semantic segmentation	KITTI (test)	mIoU81.31	16
Stereo Disparity	KITTI 2015	3PE (Non Occlusion Foreground)3.7	12

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord