Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multimodal Reasoning on MMBench v1.1 (test)
Loading...
0.837
Overall Score
Vanilla
0.72884
0.75692
0.785
0.81308
Feb 4, 2026
Overall Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Overall Score
Vanilla
Token Budget=1296 Toke...
2026.02
0.837
PIO-FVLM
Token Budget=Retain 33...
2026.02
0.821
DART
Token Budget=Retain 33...
2026.02
0.809
PIO-FVLM
Token Budget=Retain 22...
2026.02
0.809
DART
Token Budget=Retain 22...
2026.02
0.789
PIO-FVLM
Token Budget=Retain 11...
2026.02
0.776
DART
Token Budget=Retain 11...
2026.02
0.733
Feedback
Search any
task
Search any
task