Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multimodal Understanding and Reasoning on MMMU-Pro
Loading...
44.7
Accuracy
PTA-GRPO
36.588
38.694
40.8
42.906
Oct 2, 2025
Accuracy
Updated 7d ago
Evaluation Results
Method
Method
Links
Accuracy
PTA-GRPO
Base Model=Qwen2.5-7B-VL
2025.10
44.7
SRPO
Base Model=Qwen2.5-7B-VL
2025.10
42.3
Base
Base Model=Qwen2.5-7B-VL
2025.10
36.9
Feedback
Search any
task
Search any
task