Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Instruction Following and Open-ended question answering on MMAU-Pro
Loading...
100
AIF Score
Human
59.752
70.201
80.65
91.099
May 26, 2026
AIF Score
Closed Score
Open Score
Overall Score
Updated 6d ago
Evaluation Results
Method
Method
Links
AIF Score
Closed Score
Open Score
Overall Score
Human
2026.05
100
77.56
77.3
77.9
MAPO
2026.05
95.4
62
85.32
65.29
Gemini 2.5 Flash
2026.05
95.1
57.39
67.5
59.2
Qwen3-Omni-Instruct
Training mode=Instruct...
2026.05
89.66
57.84
87.38
61.84
GPT-4o Audio
2026.05
82.5
53.2
43.2
52.5
Qwen3-Omni-Thinking
Training mode=Thinking...
2026.05
62.07
59.63
84.77
62.63
Qwen2.5-Omni
Model size=7B
2026.05
61.3
52.01
52.3
52.2
Feedback
Search any
task
Search any
task