Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multimodal Understanding on MMMU out-of-distribution
Loading...
56.55
Accuracy
CalibRL
54.5948
55.1024
55.61
56.1176
Feb 22, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
CalibRL
2026.02
56.55
RL-PLUS
2026.02
55.88
GRPO
2026.02
55.44
LUFFY
2026.02
55.22
SFT+GRPO
2026.02
54.67
DAPO
2026.02
54.67
Feedback
Search any
task
Search any
task