Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-task Multimodal Understanding on MMT-Bench (val)
Loading...
72.7
Score
GPT-5
60.428
63.614
66.8
69.986
Apr 30, 2026
Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Score
GPT-5
Inference mode=Thinkin...
2026.04
72.7
Qwen3-Omni
Inference mode=Thinkin...
2026.04
70.9
Gemini 2.5 Flash
Inference mode=Thinkin...
2026.04
70.7
Qwen3-Omni
Size=30B-A3B, mode=ins...
2026.04
70.4
Gemini 2.5 Flash
Size=-, mode=instruct
2026.04
70
MiniCPM-o 4.5
Inference mode=Thinkin...
2026.04
69.7
MiniCPM-o 4.5
Size=9B, mode=instruct
2026.04
69.7
Qwen3-VL
Inference mode=Thinkin...
2026.04
68.1
InternVL3.5
Size=8B, mode=instruct
2026.04
66.7
Qwen3-VL
Size=8B, mode=instruct
2026.04
60.9
Feedback
Search any
task
Search any
task