Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multimodal Robustness on MMStar (test)
Loading...
69.8
MMStar Score
Qwen2.5-VL
49.416
54.708
60
65.292
Jan 15, 2026
MMStar Score
Updated 4d ago
Evaluation Results
Method
Method
Links
MMStar Score
Qwen2.5-VL
Params=32B
2026.01
69.8
GPT-4o
Params=-
2026.01
63.9
LVR
Params=7B
2026.01
59.4
Qwen2.5-VL
Params=7B
2026.01
58.9
Naive SFT
Params=3B
2026.01
55.53
LaViT
Params=3B
2026.01
54.07
LVR_RL
Params=3B
2026.01
53.73
PAPO
Params=3B
2026.01
52.7
R1-OneVision
Params=7B
2026.01
52.1
DMLR
Params=3B
2026.01
51.2
Qwen2.5-VL (Baseline)
Params=3B
2026.01
50.2
Feedback
Search any
task
Search any
task