Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-discipline Multimodal Reasoning on MMMU

61.3Accuracy

SophiaVL-R1-7B

31.55639.2784754.722May 22, 2025Jul 11, 2025Aug 31, 2025Oct 21, 2025Dec 10, 2025Jan 30, 2026Mar 22, 2026
Updated 25d ago

Evaluation Results

MethodLinks
2025.05
61.3
2025.05
59.1
2025.05
58.7
2025.05
58
2025.05
56.8
2025.05
56.8
2025.05
56.2
2025.05
51.6
2025.05
49.7
2025.05
48.8
2025.05
43.1
2026.03
36.1
2026.03
36.1
2026.03
35.6
2026.03
35.4
2026.03
35.1
2026.03
35
2026.03
34.9
2026.03
34.8
2026.03
34.8
2026.03
34.6
2026.03
34.6
2026.03
34.6
2026.03
34.6
2026.03
34.4
2026.03
34.4
2026.03
34.3
2026.03
34.2
2026.03
34.1
2026.03
33.9
2026.03
33.8
2026.03
33.4
2026.03
32.7