| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Semantic Segmentation | SUN RGB-D (test) | mIoU54.6 | 191 | |
| 3D Object Detection | SUN RGB-D (val) | mAP@0.2569.7 | 158 | |
| 3D Object Detection | SUN RGB-D | mAP@0.2567.9 | 104 | |
| Depth Estimation | SUN RGB-D (test) | Root Mean Square Error (RMS)0.275 | 93 | |
| 3D Object Detection | SUN RGB-D v1 (val) | mAP@0.2568.9 | 81 | |
| 3D Object Detection | SUN RGB-D (test) | mAP@0.2567.4 | 64 | |
| Semantic Segmentation | SUN RGB-D | mIoU53 | 45 | |
| Scene Recognition | SUN RGB-D Scene (test) | Acc (RGB-D)60.7 | 25 | |
| Multi-modal Recognition | SUN RGB-D | Accuracy0.5807 | 24 | |
| Monocular Depth Estimation | SUN RGB-D | Absolute Relative Error (Abs Rel)0.085 | 19 | |
| Indoor Object Detection | SUN RGB-D (test) | mAP@0.547.5 | 19 | |
| Depth Estimation | SUN RGB-D | Depth Error0.386 | 18 | |
| Depth Completion | SUN RGB-D (test) | RMSE0.214 | 18 | |
| 3D Object Detection | SUN RGB-D v1 (test) | Bed AP82.9 | 18 | |
| Monocular Depth Estimation | SUN RGB-D v1 (test) | Delta-1 Acc93.7 | 14 | |
| 3D Layout Estimation | SUN RGB-D | IoU64.4 | 14 | |
| 3D Spatial Grounding | SUN RGB-D | AP1548.3 | 10 | |
| Semantic Segmentation | SUN RGB-D (val) | Wall mIoU74.99 | 9 | |
| Camera Pose Estimation | SUN RGB-D | Pitch2.63 | 9 | |
| Object Detection | SUN RGB-D (test) | mAP19.1 | 8 | |
| 3D Layout Estimation | SUN RGB-D v1 (test) | Average IoU66.1 | 8 | |
| Depth Estimation | SUN RGB-D unseen indoor scenes | δ1 Accuracy86.6 | 7 | |
| 3D Object Recognition | SUN RGB-D (common classes split) | Top-1 Accuracy (Average)69.6 | 6 | |
| Scene Classification | SUN RGB-D | Accuracy64.9 | 6 | |
| 3D Object Detection | Traditional SUN RGB-D | AP@0.2568.8 | 6 |