Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Vision-Language Reasoning on MME (test)
Loading...
78.98
Simple Accuracy
No intervention baseline
72.584
74.2445
75.905
77.5655
Jan 18, 2026
Simple Accuracy
Paired Accuracy
Yes-Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Simple Accuracy
Paired Accuracy
Yes-Rate
No intervention baseline
Intervention=None
2026.01
78.98
30.77
42.29
Q4 system abl
Intervention=Q4 system...
2026.01
78.9
30
41.2
PAI
Intervention=PAI
2026.01
78.73
30.77
42.62
AD-HH
Intervention=AD-HH
2026.01
78.73
29.23
42.88
Image×2.0
Intervention=Image×2.0
2026.01
78.35
30.77
40.14
Q4 text redistr (prop)
Intervention=Q4 text r...
2026.01
78.26
29.23
41.07
Q4 system redistr (prop)
Intervention=Q4 system...
2026.01
72.83
21.54
28.98
Feedback
Search any
task
Search any
task