Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Visual Reasoning on ROBOSPATIAL
Loading...
69.5
Accuracy
VALOR
53.38
57.565
61.75
65.935
Dec 9, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
VALOR
Model Category=Open-So...
2025.12
69.5
Gemini-2.5-Flash
Model Category=Proprie...
2025.12
68.7
o4-mini
Model Category=Proprie...
2025.12
61.8
VALOR-RL
Model Category=Open-So...
2025.12
61.8
Qwen3-8B
Model Category=Open-So...
2025.12
60.5
Llama-3.2-11B
Model Category=Open-So...
2025.12
58.3
Gemini-2.0-Flash
Model Category=Proprie...
2025.12
57
GPT-4o
Model Category=Proprie...
2025.12
56.6
Claude-3.5-Haiku
Model Category=Proprie...
2025.12
55.7
Gemma-3-12B
Model Category=Open-So...
2025.12
54
Feedback
Search any
task
Search any
task