| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| nuScenes VQA | LLaVA-NeXT-Interleave | Accuracy0.767 | 14 | 3d ago | |
| ScanQA | Chat-Scene | C Score87.7 | 10 | 3d ago | |
| 3DMV-VQA (held-out) | 3D-LLM | Concept Accuracy68.9 | 10 | 3d ago | |
| OMNI3D BENCH | VALOR | Accuracy44 | 7 | 3d ago | |
| SQA (test) | BridgeQA | EM@153.32 | 5 | 3d ago | |
| SQA3D | SQA3D-LLaMA | EM@148.09 | 3 | 3d ago | |
| 3DMV-VQA single room only subset of 1,212 scenes | Scene-LLM | Concept Accuracy70.2 | 2 | 3d ago | |
| SQA (val) | BridgeQA | EM@152.05 | 1 | 3d ago |