Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Reasoning on MMKI2

80.61Avg@8 Accuracy

PAPO_D

46.39455.27764.1673.043Jul 8, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2025.07
80.61
2025.07
75.93
2025.07
72.52
2025.07
72.26
2025.07
66.83
2025.07
64.09
2025.07
57.39
2025.07
57.24
2025.07
48.57
2025.07
47.71