Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-modal Instruction Following on MM MTBench
Loading...
84.9
Overall Score
Ministral 3
50.996
59.798
68.6
77.402
Jan 13, 2026
Overall Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Overall Score
Ministral 3
Model Size=14B
2026.01
84.9
Ministral 3
Model Size=8B
2026.01
80.8
Qwen3-VL
Model Size=4B, Variant...
2026.01
80.08
Qwen3-VL
Model Size=8B, Variant...
2026.01
80
Ministral 3
Model Size=3B
2026.01
78.3
Gemma3
Model Size=12B, Varian...
2026.01
67
Qwen3-VL
Model Size=2B, Variant...
2026.01
63.6
Gemma3
Model Size=4B, Variant...
2026.01
52.3
Feedback
Search any
task
Search any
task