Pattern-Affinitive Propagation across Depth, Surface Normal and Semantic Segmentation
About
In this paper, we propose a novel Pattern-Affinitive Propagation (PAP) framework to jointly predict depth, surface normal and semantic segmentation. The motivation behind it comes from the statistic observation that pattern-affinitive pairs recur much frequently across different tasks as well as within a task. Thus, we can conduct two types of propagations, cross-task propagation and task-specific propagation, to adaptively diffuse those similar patterns. The former integrates cross-task affinity patterns to adapt to each task therein through the calculation on non-local relationships. Next the latter performs an iterative diffusion in the feature space so that the cross-task affinity patterns can be widely-spread within the task. Accordingly, the learning of each task can be regularized and boosted by the complementary task-level affinities. Extensive experiments demonstrate the effectiveness and the superiority of our method on the joint three tasks. Meanwhile, we achieve the state-of-the-art or competitive results on the three related datasets, NYUD-v2, SUN-RGBD and KITTI.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Depth Estimation | NYU v2 (test) | Threshold Accuracy (delta < 1.25)84.6 | 423 | |
| Semantic segmentation | NYU v2 (test) | mIoU50.4 | 248 | |
| Surface Normal Estimation | NYU v2 (test) | Mean Angle Distance (MAD)18.6 | 206 | |
| Semantic segmentation | SUN RGB-D (test) | mIoU50.5 | 191 | |
| Semantic segmentation | NYUD v2 (test) | mIoU50.4 | 187 | |
| Depth Estimation | NYU Depth V2 | RMSE0.53 | 177 | |
| Semantic segmentation | NYU Depth V2 (test) | mIoU50.4 | 172 | |
| Monocular Depth Estimation | KITTI (test) | Abs Rel Error10.27 | 103 | |
| Surface Normal Prediction | NYU V2 | Mean Error18.6 | 100 | |
| Semantic segmentation | NYUDv2 40-class (test) | mIoU50.4 | 99 |