Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multimodal Multi-choice on MMStar (Accuracy)
Loading...
65.1
Accuracy
Claude3.7-Sonnet
45.756
50.778
55.8
60.822
Dec 18, 2025
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
Claude3.7-Sonnet
2025.12
65.1
SkiLa
2025.12
64.8
SkiLa-V
2025.12
64.5
LLaVA-OneVision 7B
Model Size=7B
2025.12
61.9
GPT-4o
2025.12
61.6
Direct SFT
2025.12
61.4
LVR 7B*
Model Size=7B, Tested...
2025.12
61.3
Vision-R1 7B*
Model Size=7B, Tested...
2025.12
60.7
Qwen2.5-VL 7B
Model Size=7B
2025.12
60.3
Gemma3 27B
Model Size=27B
2025.12
59.6
GPT-4v
2025.12
56
GPT-4o-mini
2025.12
54.8
ROSS 7B
Model Size=7B
2025.12
53.9
Cambrian 13B
Model Size=13B
2025.12
47.1
Janus-Pro 7B
Model Size=7B
2025.12
46.5
Feedback
Search any
task
Search any
task