Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Visual Reasoning on ScienceQA
Loading...
87.2
Accuracy
DualMindVLM
82
83.35
84.7
86.05
Nov 20, 2025
Accuracy
Sample Length
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Sample Length
DualMindVLM
Size=7B, Strategy=RL
2025.11
87.2
98
VL-Rethinker
Size=7B, Strategy=RL
2025.11
85.5
205
Qwen2.5-VL
Size=7B, Strategy=-
2025.11
84
156
MM-Eureka
Size=7B, Strategy=RL
2025.11
83.5
202
OpenVLThinker
Size=7B, Strategy=SFT+RL
2025.11
82.2
171
Feedback
Search any
task
Search any
task