Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Scientific Multimodal Reasoning on SFE
Loading...
44.06
Accuracy
GPT-5
24.6224
29.6687
34.715
39.7613
Apr 23, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-5
2026.04
44.06
Intern-S1
Parameter Count=235B+6B
2026.04
43.98
S1-VL-32B-RL
Parameter Count=32B, T...
2026.04
43.1
Gemini 2.5 Pro
2026.04
43
S1-VL-32B-SFT
Parameter Count=32B, T...
2026.04
42.58
Qwen3-VL-235B-A22B-Thinking
Parameter Count=235B-A...
2026.04
39.98
Gemini 2.5 Flash
2026.04
37.6
Qwen3-VL-32B-Thinking
Parameter Count=32B, R...
2026.04
37.5
Intern-S1-mini
Parameter Count=8B
2026.04
36.93
Thyme-VL
Parameter Count=7B
2026.04
25.37
Feedback
Search any
task
Search any
task