Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

M3CoT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-modal ReasoningM3CoT
Accuracy82.68
90
Visual Question AnsweringM3CoT
Accuracy61.2
71
Multimodal ReasoningM3CoT (test)
Total Acc91.61
55
Multimodal Chain-of-Thought ReasoningM3CoT
Accuracy50.5
42
General Visual ReasoningM3CoT
Accuracy74.2
17
Multimodal Commonsense ReasoningM3CoT social-science and social-commonsense sub-topics
Accuracy Change11.99
12
General multimodal reasoningM3CoT
Pass@1 Accuracy78.21
11
Visual Evidence Quality EvaluationM3CoT Reasoning (subset of 500 samples)
AIM-CoT Win Rate76.4
2
Showing 8 of 8 rows