Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

M3CoT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Visual Question AnsweringM3CoT
Accuracy61.2
56
Multimodal ReasoningM3CoT (test)
Total Acc91.61
47
Multimodal Chain-of-Thought ReasoningM3CoT
Accuracy50.5
42
General Visual ReasoningM3CoT
Accuracy74.2
17
Multi-modal ReasoningM3CoT
Accuracy66.94
12
General multimodal reasoningM3CoT
Pass@1 Accuracy78.21
11
Visual Evidence Quality EvaluationM3CoT Reasoning (subset of 500 samples)
AIM-CoT Win Rate76.4
2
Showing 7 of 7 rows