| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Clustering | Scene-15 | Accuracy49.4 | 40 | |
| Unsupervised Feature Selection | Scene | NMI40.49 | 14 | |
| Clustering | Scene | Accuracy41.96 | 14 | |
| Description Quality | Scene 2 Suite main room | Color Accuracy79 | 8 | |
| Object Retrieval | Scene 2 Suite main room | SRobj78 | 8 | |
| Navigation | Scene 2 Suite main room | SRnav0.8 | 8 | |
| Relative Position | Scene 2 Suite main room | Accrel80 | 8 | |
| Description Quality | Scene 3 Laboratory storage 1.0 (overall) | Color Accuracy78 | 8 | |
| Instruction-based Navigation | Scene 3 Laboratory storage 1.0 (overall) | SRnav58 | 8 | |
| Relative Position Reasoning | Scene 3 Laboratory storage 1.0 (overall) | Accrel74 | 8 | |
| Description Quality | Scene Simple room 1 | Color Accuracy82 | 8 | |
| Object Retrieval | Scene Simple room 1 (overall) | SRobj83 | 8 | |
| Navigation | Scene Simple room 1 (overall) | Success Rate (SR)78 | 8 | |
| Relative Position Reasoning | Scene Simple room 1 (overall) | Accrel86 | 8 | |
| Multimodal Classification | Scene15 (test) | Accuracy77.9 | 8 | |
| 3D Scene Stylization | scene (train) | FID14.8 | 6 | |
| Multi-view Classification | Scene15 (test) | Average Test Accuracy65.74 | 6 | |
| Motion Planning | Scene OOD environment generated by MotionGeneralizer (test) | Success Rate38 | 5 | |
| Multilabel classification | scene (test) | Accuracy91.58 | 5 | |
| Normal Estimation | Scene200 | Accuracy (11.25° Threshold)59.4 | 4 | |
| 3D Pose Estimation | Scene 5 Corridor | MPJPE (mm)317.1 | 3 | |
| 3D Pose Estimation | Scene 4 Workstation | MPJPE222.4 | 3 |