Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-image Spatial Reasoning on SPAR-Bench-MV + MindCube-Tiny + MMSI-Bench (test)
Loading...
51
Overall Score
GPT-5.2
28.536
34.368
40.2
46.032
Feb 9, 2026
Overall Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Overall Score
GPT-5.2
Category=Proprietary
2026.02
51
Gemini-3-Pro
Category=Proprietary
2026.02
44.9
HATCH
Base Model=Qwen2.5-VL-3B
2026.02
43.6
GPT-4.1
Category=Proprietary
2026.02
40.7
Qwen2.5-VL-72B
Category=Open-Weight
2026.02
36.5
InternVL-2.5-4B
Category=Open-Weight
2026.02
35.5
SpatialLadder-3B
Base Model=Qwen2.5-VL-3B
2026.02
35.4
Video-R1
Base Model=Qwen2.5-VL-7B
2026.02
34.2
Qwen2.5-VL-32B
Category=Open-Weight
2026.02
33.8
LLaVA-OneVision-4B
Category=Open-Weight
2026.02
33.7
SpaceR-7B
Base Model=Qwen2.5-VL-7B
2026.02
31.9
Spatial-MLLM-4B
Base Model=Qwen2.5-VL-3B
2026.02
30.7
Qwen2.5-VL-7B
Base Model=Qwen2.5-VL-7B
2026.02
30.4
InternVL-2.5-8B
Category=Open-Weight
2026.02
30.3
Qwen2.5-VL-3B
Base Model=Qwen2.5-VL-3B
2026.02
29.4
Feedback
Search any
task
Search any
task