| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Semantic Segmentation | SUN RGB-D (test) | mIoU54.6 | 212 | |
| 3D Object Detection | SUN RGB-D (val) | mAP@0.2569.7 | 163 | |
| 3D Object Detection | SUN RGB-D | mAP@0.2567.9 | 104 | |
| Depth Estimation | SUN RGB-D (test) | Root Mean Square Error (RMS)0.275 | 93 | |
| 3D Object Detection | SUN RGB-D v1 (val) | mAP@0.2568.9 | 81 | |
| Semantic Segmentation | SUN RGB-D | mIoU53 | 65 | |
| 3D Object Detection | SUN RGB-D (test) | mAP@0.2567.4 | 64 | |
| 3D Object Detection | SUN RGB-D | Base AP@0.2568.16 | 40 | |
| Depth Estimation | SUN RGB-D | Depth Error0.386 | 34 | |
| Object Detection | SUN RGB-D (test) | mAP55.7 | 25 | |
| Scene Recognition | SUN RGB-D Scene (test) | Acc (RGB-D)60.7 | 25 | |
| Multi-modal Recognition | SUN RGB-D | Accuracy0.5807 | 24 | |
| Monocular Depth Estimation | SUN RGB-D | Absolute Relative Error (Abs Rel)0.085 | 19 | |
| Indoor Object Detection | SUN RGB-D (test) | mAP@0.547.5 | 19 | |
| Depth Completion | SUN RGB-D (test) | RMSE0.214 | 18 | |
| 3D Object Detection | SUN RGB-D v1 (test) | Bed AP82.9 | 18 | |
| Monocular Depth Estimation | SUN RGB-D v1 (test) | Delta-1 Acc93.7 | 14 | |
| 3D Layout Estimation | SUN RGB-D | IoU64.4 | 14 | |
| Object Detection | SUN RGB-D | mAP@0.536.6 | 13 | |
| Object Detection | SUN RGB-D | GFLOPS476.5 | 12 | |
| 3D Spatial Grounding | SUN RGB-D | AP1548.3 | 10 | |
| Metric Depth Estimation | SUN RGB-D | AbsRel0.451 | 10 | |
| Indoor Scene Recognition | SUN RGB-D | Mean-class Accuracy57.7 | 9 | |
| Semantic Segmentation | SUN RGB-D (val) | Wall mIoU74.99 | 9 | |
| Camera Pose Estimation | SUN RGB-D | Pitch2.63 | 9 |