EdgeStereo: A Context Integrated Residual Pyramid Network for Stereo Matching

About

Recent convolutional neural networks, especially end-to-end disparity estimation models, achieve remarkable performance on stereo matching task. However, existed methods, even with the complicated cascade structure, may fail in the regions of non-textures, boundaries and tiny details. Focus on these problems, we propose a multi-task network EdgeStereo that is composed of a backbone disparity network and an edge sub-network. Given a binocular image pair, our model enables end-to-end prediction of both disparity map and edge map. Basically, we design a context pyramid to encode multi-scale context information in disparity branch, followed by a compact residual pyramid for cascaded refinement. To further preserve subtle details, our EdgeStereo model integrates edge cues by feature embedding and edge-aware smoothness loss regularization. Comparative results demonstrates that stereo matching and edge detection can help each other in the unified model. Furthermore, our method achieves state-of-art performance on both KITTI Stereo and Scene Flow benchmarks, which proves the effectiveness of our design.

Xiao Song, Xu Zhao, Hanwen Hu, Liangji Fang• 2018

Related benchmarks

Task	Dataset	Result
Stereo Matching	KITTI 2015 (test)	D1 Error (Overall)2.08	233
Stereo Matching	KITTI 2012 (test)	Outlier Rate (3px, Noc)1.46	105
Stereo Matching	Scene Flow	EPE (px)1.12	40

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord