Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-modal Question Answering on MMStar
Loading...
81
Accuracy
Qwen3.5-27B
52.4
59.825
67.25
74.675
Oct 27, 2025
Nov 23, 2025
Dec 20, 2025
Jan 17, 2026
Feb 13, 2026
Mar 12, 2026
Apr 9, 2026
Accuracy
Updated 6d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3.5-27B
Mode=REASONING, Archit...
2026.04
81
Qwen3-VL-32B
Mode=Thinking, Archite...
2026.04
79.4
Qwen3-VL-235B-A22B
Mode=Thinking, Archite...
2026.04
78.7
EXAONE 4.5 33B
Mode=REASONING, Archit...
2026.04
74.9
GPT-5 mini
Mode=REASONING: HIGH,...
2026.04
74.1
MergeMix
baseline=SFT Vision
2025.10
62.92
HART-7B
Resolution=511 × 391
2026.02
62.8
SFT Vision
2025.10
62.66
Qwen2.5-VL-Ins-7B
2025.10
62.42
VisionThink-7B
2025.10
61
Qwen2.5-VL-7B
Resolution=511 × 391
2026.02
59.3
LLaVA-OneVision-7B
Resolution=511 × 391
2026.02
56.7
InternVL3-7B
Resolution=511 × 391
2026.02
53.5
Feedback
Search any
task
Search any
task