Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-image Understanding on MuirBench Multi-image Understanding
Loading...
62.3
Accuracy
GPT-4V
-2.05728
14.65086
31.359
48.06714
Dec 16, 2025
Dec 17, 2025
Dec 19, 2025
Dec 21, 2025
Dec 23, 2025
Dec 25, 2025
Dec 27, 2025
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-4V
Proprietary=true
2025.12
62.3
Qwen2.5-VL-7B
Parameters=7B
2025.12
58.2
InternVL-2.5-8B
Parameters=8B
2025.12
51.2
SDAR-VL-8B-Inst
Parameters=8B, Trainin...
2025.12
50.2
InternVL-2-8B
Parameters=8B
2025.12
48.5
LLaDA-V-8B
Parameters=8B, Control...
2025.12
48.3
Qwen2.5-VL-3B
Parameters=3B
2025.12
46.5
InternVL-2.5-4B
Parameters=4B
2025.12
45.1
SDAR-VL-4B-Inst
Parameters=4B, Trainin...
2025.12
44.8
LLaVA-OV-7B
Parameters=7B
2025.12
40.5
InternVL-2-4B
Parameters=4B
2025.12
40.3
Qwen2-VL-7B
Parameters=7B
2025.12
39.9
GPT-4o
LLM Backbone=N/A, Open...
2025.12
0.68
MAmmoTH-VL
LLM Backbone=Qwen2.5-7...
2025.12
0.551
Dream-VL
LLM Backbone=Dream 7B,...
2025.12
0.512
LLaDA-V
LLM Backbone=LLaDA 8B,...
2025.12
0.483
LLaVA-OV
LLM Backbone=Qwen2-7B,...
2025.12
0.418
Feedback
Search any
task
Search any
task