| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| SQA3D | Video-3D LLM | EM@158.6 | 21 | 27d ago | |
| OMNI3D BENCH | CoVFT | Accuracy67.58 | 20 | 2mo ago | |
| ScanQA | Video-3D LLM | EM@130.1 | 17 | 27d ago | |
| Hypo3D | POMA-3D_llm | EM@135.9 | 14 | 27d ago | |
| nuScenes VQA | LLaVA-NeXT-Interleave | Accuracy0.767 | 14 | 3mo ago | |
| ScanQA | Chat-Scene | C Score87.7 | 10 | 3mo ago | |
| 3DMV-VQA (held-out) | 3D-LLM | Concept Accuracy68.9 | 10 | 3mo ago | |
| MM-Vet | SAGE-13B | GPT-4 Score54.89 | 9 | 2mo ago | |
| SQA (test) | BridgeQA | EM@153.32 | 5 | 3mo ago | |
| 3DMV-VQA single room only subset of 1,212 scenes | Scene-LLM | Concept Accuracy70.2 | 2 | 3mo ago | |
| SQA (val) | BridgeQA | EM@152.05 | 1 | 3mo ago |