Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Reasoning on General Multimodal Reasoning Aggregate

65.83Average Performance

PAPO_D

48.46252.97157.4861.989Jul 8, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2025.07
65.8315.61
2025.07
63.51.53
2025.07
62.51-
2025.07
57.58-
2025.07
57.095
2025.07
55.02-
2025.07
53.393.38
2025.07
51.89-
2025.07
51.364.52
2025.07
49.13-