Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Video Multimodal Understanding on VideoMMMU
Loading...
61.2
Accuracy
GPT-4o
19.184
30.092
41
51.908
Jan 9, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-4o
Category=Closed-source
2026.01
61.2
Gemini-1.5-Pro
Category=Closed-source
2026.01
60.6
Qwen-VL-2.5-7B-Ours
Category=Our Models
2026.01
50
MiniCPM-V2.6-8B
Category=Open-source B...
2026.01
49.8
Video-R1-7B
Category=Open-source R...
2026.01
48.1
Qwen-VL-2.5-7B-GRPO
Category=Our Models
2026.01
47.3
Qwen-VL-2.5-7B-SFT
Category=Our Models
2026.01
46
InternVL2.5-8B
Category=Open-source B...
2026.01
44.2
R1-OneVision-7B
Category=Open-source R...
2026.01
44.1
Qwen-VL-2.5-7B
Category=Our Models
2026.01
43.9
R1-VL-7B
Category=Open-source R...
2026.01
42.9
Vision-R1-7B
Category=Open-source R...
2026.01
39.7
LLaVA-OneVision-7B
Category=Open-source V...
2026.01
31.2
VILA-1.5-8B
Category=Open-source V...
2026.01
20.8
Feedback
Search any
task
Search any
task