Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
2D Spatial Reasoning on GQA
Loading...
65.2
Accuracy
Gemini-2.5-Flash
38.888
45.719
52.55
59.381
Dec 9, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Gemini-2.5-Flash
Model Category=Proprie...
2025.12
65.2
o4-mini
Model Category=Proprie...
2025.12
65
VALOR
Model Category=Open-So...
2025.12
64.4
Claude-3.5-Haiku
Model Category=Proprie...
2025.12
61.3
GPT-4o
Model Category=Proprie...
2025.12
58
VALOR-RL
Model Category=Open-So...
2025.12
57.6
Qwen3-8B
Model Category=Open-So...
2025.12
57.4
Gemini-2.0-Flash
Model Category=Proprie...
2025.12
52.1
Gemma-3-12B
Model Category=Open-So...
2025.12
46
Llama-3.2-11B
Model Category=Open-So...
2025.12
39.9
Feedback
Search any
task
Search any
task