Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

EdgeStereo: A Context Integrated Residual Pyramid Network for Stereo Matching

About

Recent convolutional neural networks, especially end-to-end disparity estimation models, achieve remarkable performance on stereo matching task. However, existed methods, even with the complicated cascade structure, may fail in the regions of non-textures, boundaries and tiny details. Focus on these problems, we propose a multi-task network EdgeStereo that is composed of a backbone disparity network and an edge sub-network. Given a binocular image pair, our model enables end-to-end prediction of both disparity map and edge map. Basically, we design a context pyramid to encode multi-scale context information in disparity branch, followed by a compact residual pyramid for cascaded refinement. To further preserve subtle details, our EdgeStereo model integrates edge cues by feature embedding and edge-aware smoothness loss regularization. Comparative results demonstrates that stereo matching and edge detection can help each other in the unified model. Furthermore, our method achieves state-of-art performance on both KITTI Stereo and Scene Flow benchmarks, which proves the effectiveness of our design.

Xiao Song, Xu Zhao, Hanwen Hu, Liangji Fang• 2018

Related benchmarks

TaskDatasetResultRank
Stereo MatchingKITTI 2015 (test)
D1 Error (Overall)2.08
144
Stereo MatchingKITTI 2012 (test)
Outlier Rate (3px, Noc)1.46
76
Stereo MatchingScene Flow
EPE (px)1.12
40
Showing 3 of 3 rows

Other info

Follow for update