Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Reasoning on MMMU (Efficiency and Performance Metrics)

57.1MMMU Score

DPA-Qwen3-32B

31.93238.4664551.534Mar 28, 2025Jun 4, 2025Aug 12, 2025Oct 20, 2025Dec 27, 2025Mar 6, 2026May 14, 2026
Updated 6d ago

Evaluation Results

MethodLinks
2026.05
57.1---
2026.05
56.9---
2025.11
51.5672.490.1921.09
2025.11
51.5670.572.1161.76
2025.11
51.44720.6839.15
2025.11
51.3372.6800
2025.11
51.1171.720.96-
2025.11
51.1171.940.7443.07
2025.11
5171.910.77-
2025.11
50.6772.180.5-
2025.11
50.5670.022.65-
2025.11
50.5670.322.3663.66
2026.05
50---
2026.05
49.9---
2025.11
49.3370.312.37-
2025.11
48.1166.616.07-
2026.05
43.1---
2026.05
41.1---
2026.05
40.8---
2026.05
37.8---
2025.03
36.87---
2026.05
35.1---
2026.05
34.9---
2025.03
34.62---
2025.03
34.23---
2025.03
33.16---
2026.05
32.9---