Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Understanding on MMMU (dev)

67.3Accuracy

PVM-8B (SFT + GRPO)

18.10830.87943.6556.421Jan 24, 2026Feb 9, 2026Feb 25, 2026Mar 13, 2026Mar 29, 2026Apr 14, 2026May 1, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.05
67.3
2026.05
66.7
2026.05
65.3
2026.05
64.7
2026.05
63.3
2026.05
63.3
2026.05
62.7
2026.05
62
2026.05
62
2026.05
62
2026.05
60.7
2026.05
60.7
2026.05
60.7
2026.05
59.3
2026.05
58
2026.05
58
2026.05
57.3
2026.05
57.3
2026.05
56
2026.05
56
2026.01
25.33
2026.01
23.33
21.33
20.67
2026.01
20