Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multimodal Understanding on MMMU (dev)
Loading...
25.33
Accuracy
Defender
19.7868
21.2259
22.665
24.1041
Jan 24, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Defender
Iteration=3
2026.01
25.33
Defender
Iteration=2
2026.01
23.33
Base (M_def^(0)) + Clean Data
Data=Cleaned
2026.01
21.33
Base (M_def^(0))
2026.01
20.67
Defender
Iteration=1
2026.01
20
Feedback
Search any
task
Search any
task