Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-modal Reasoning on MMBench Overall & Relation Reasoning

84.7Overall Accuracy

ChainMPQ

63.06868.68474.379.916Oct 7, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.10
84.71.581.83.6
2025.10
84.20.683.91.4
2025.10
83.6-82.5-
2025.10
83.2-78.2-
2025.10
67.81.361.32.5
2025.10
66.5-58.8-
2025.10
65.51.655.22.7
2025.10
63.9-52.5-