| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Text-guided 3D scene generation | 3D Scenes with Qwen1.5 captions (Scene chunks) | CLIP-Score23.79 | 4 | |
| Text-guided 3D scene generation | 3D Scenes with Qwen1.5 captions (Independent chunks) | CLIP-Score23.96 | 4 | |
| 3D Scene Generation | 3D Scenes average across all scene types | Avg FID24.34 | 3 | |
| Active View Planning | Large 3D Scenes (test) | Dunnottar Castle73.9 | 3 |