Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Visual Tool-Use on V* Bench
Loading...
88.2
Accuracy
Mini-o3
64.28
70.49
76.7
82.91
Dec 4, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Mini-o3
Source=Lai et al. (2025)
2025.12
88.2
ARM-Thinker-7B
Size=7B, Backbone=Qwen...
2025.12
86.4
Pixel Reasoner
Source=Lai et al. (2025)
2025.12
86.3
DeepEyes
Source=Lai et al. (2025)
2025.12
83.3
Qwen3-VL-8B
Size=8B
2025.12
82.2
Qwen2.5-VL-7B
Size=7B
2025.12
75.4
InternVL3-8B
Size=8B
2025.12
69.6
InternVL3.5-8B
Size=8B
2025.12
69.1
GPT-4o
Source=Lai et al. (2025)
2025.12
65.2
Feedback
Search any
task
Search any
task