| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Depth Estimation | NYU v2 (test) | Threshold Accuracy (delta < 1.25)98.9 | 432 | |
| Monocular Depth Estimation | NYU v2 (test) | Abs Rel0.046 | 300 | |
| Semantic Segmentation | NYU v2 (test) | mIoU59.13 | 282 | |
| Surface Normal Estimation | NYU v2 (test) | Mean Angle Distance (MAD)8.6 | 224 | |
| Depth Super-Resolution | NYU v2 (test) | RMSE1.49 | 136 | |
| Monocular Depth Estimation | NYU v2 | Delta 1 Acc98.8 | 131 | |
| Surface Normal Prediction | NYU v2 | Mean Error13.34 | 118 | |
| 3D Hand Pose Estimation | NYU (test) | Mean Error (mm)7.06 | 100 | |
| Semantic Scene Completion | NYU v2 (test) | Ceiling Error0 | 81 | |
| Joint Depth Super-Resolution and Denoising | NYU v2 (test) | RMSE5.14 | 78 | |
| Semantic Segmentation | NYU V2 | mIoU63.6 | 74 | |
| Affine-invariant depth estimation | NYU v2 | AbsRel0.042 | 59 | |
| Depth Estimation | NYU v2 | RMSE0.307 | 57 | |
| Depth Estimation | NYU v2 (val) | RMSE0.201 | 53 | |
| Scene Completion | NYU dataset (test) | mIoU75 | 50 | |
| Scene Completion | NYU v2 (test) | mIoU78.2 | 48 | |
| Semantic Scene Completion | NYU (test) | Ceiling Error0 | 46 | |
| Depth Super-Resolution | NYU v2 | RMSE0.0959 | 41 | |
| Depth Completion | NYU v2 (val) | RMSE0.09 | 41 | |
| Semantic Segmentation | NYU v2 (val) | mIoU60 | 37 | |
| Depth Super-Resolution / Completion | NYU v2 (test) | AbsRel1.53 | 36 | |
| Surface Normal Estimation | NYU v2 | Mean Angular Error-22.1 | 33 | |
| Metric Depth Estimation | NYU Metric Depth v2 (test) | Delta 1 Accuracy98.9 | 33 | |
| Depth Completion | NYU v2 | RMSE0.085 | 32 | |
| Multi-Task Learning | NYU v2 (test) | Delta m%403 | 31 |