Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Vision-Language Understanding on MMBench
Loading...
88.7
Accuracy
Qwen3-VL-4B-Instruct
-3.548
20.401
44.35
68.299
Feb 4, 2026
Feb 5, 2026
Feb 7, 2026
Feb 8, 2026
Feb 10, 2026
Feb 11, 2026
Feb 13, 2026
Accuracy
Updated 2d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3-VL-4B-Instruct
2026.02
88.7
Xiaomi-Robotics-0
pre-training=with VL data
2026.02
84.4
MolmoAct
2026.02
80.1
Vanilla
Backbone=LLaVA-NeXT-7B...
2026.02
67.4
PIO-FVLM
Backbone=LLaVA-NeXT-7B...
2026.02
66.2
CDPruner
Backbone=LLaVA-NeXT-7B...
2026.02
65.5
DART
Backbone=LLaVA-NeXT-7B...
2026.02
65.3
HoloV
Backbone=LLaVA-NeXT-7B...
2026.02
65.3
VisionZip
Backbone=LLaVA-NeXT-7B...
2026.02
63.1
FastV
Backbone=LLaVA-NeXT-7B...
2026.02
61.6
SparseVLM
Backbone=LLaVA-NeXT-7B...
2026.02
60.6
pi_0.5
2026.02
22.1
pi_0
2026.02
0
Xiaomi-Robotics-0
pre-training=without V...
2026.02
0
Feedback
Search any
task
Search any
task