Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Commonsense Reasoning on M3CoT social-science and social-commonsense sub-topics

11.99Accuracy Change

GPT-4o

-17.2028-9.6239-2.0455.5339Jul 27, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.07
11.99-
2025.07
5.9-
2025.07
1.89-
2025.07
0.92-
2025.07
0.06-
2025.07
-0.59-
2025.07
-2.28-
2025.07
-7.23-
2025.07
-9.78-
2025.07
-11.3-
2025.07
-15.15-
2025.07
-16.08-
2025.07
-57.23
2025.07
-65.1
2025.07
-81.4
2025.07
-81.76