Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General Perception and Reasoning on MMStar
Loading...
68.8
Score
Claude-3.7-Sonnet
51.536
56.018
60.5
64.982
Dec 26, 2025
Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Score
Claude-3.7-Sonnet
Data=-
2025.12
68.8
InternVL3-8B
Data=-
2025.12
68.2
BiPS-General-7B
Data=13K+39K
2025.12
65.7
GPT-4o
Data=-
2025.12
65.1
BiPS-Chart-7B
Data=13K
2025.12
64.9
Vision-R1-7B
Data=73K + 137K
2025.12
64.8
GRPO
Data=13K + 39K
2025.12
64.6
DeepEyes-7B
Data=14K + 33K
2025.12
63
Qwen2.5-VL-7B
Data=-
2025.12
62.1
Chart-R1-7B
Data=258K
2025.12
61.1
R1-OneVision-7B
Data=67K + 98K
2025.12
52.2
Feedback
Search any
task
Search any
task