SegStereo: Exploiting Semantic Information for Disparity Estimation
About
Disparity estimation for binocular stereo images finds a wide range of applications. Traditional algorithms may fail on featureless regions, which could be handled by high-level clues such as semantic segments. In this paper, we suggest that appropriate incorporation of semantic cues can greatly rectify prediction in commonly-used disparity estimation frameworks. Our method conducts semantic feature embedding and regularizes semantic cues as the loss term to improve learning disparity. Our unified model SegStereo employs semantic features from segmentation and introduces semantic softmax loss, which helps improve the prediction accuracy of disparity maps. The semantic cues work well in both unsupervised and supervised manners. SegStereo achieves state-of-the-art results on KITTI Stereo benchmark and produces decent prediction on both CityScapes and FlyingThings3D datasets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Stereo Matching | KITTI 2015 (test) | D1 Error (Overall)0.0744 | 144 | |
| Stereo Matching | KITTI 2015 | D1 Error (All)2.25 | 118 | |
| Disparity Estimation | KITTI 2015 (test) | D1 Error (bg, all)1.88 | 77 | |
| Stereo Matching | KITTI 2012 (test) | Outlier Rate (3px, Noc)1.68 | 76 | |
| Stereo Matching | KITTI Noc 2015 | D1 Error (Background)1.76 | 32 | |
| Stereo Matching | KITTI 2012 (Noc) | Error Rate (>2px)2.66 | 26 | |
| Stereo Matching | KITTI 2012 (All split) | Error Rate (>2px)3.19 | 26 | |
| Disparity Estimation | Scene Flow (test) | EPE1.45 | 24 | |
| Semantic segmentation | KITTI (test) | mIoU81.31 | 16 | |
| Stereo Disparity | KITTI 2015 | 3PE (Non Occlusion Foreground)3.7 | 12 |