Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Scientific Multimodal Reasoning on Galaxy-10
Loading...
57.72
Accuracy
GPT-5
14.1024
25.4262
36.75
48.0738
Apr 23, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-5
2026.04
57.72
S1-VL-32B-RL
Parameter Count=32B, T...
2026.04
51.52
Intern-S1
Parameter Count=235B+6B
2026.04
48.37
S1-VL-32B-SFT
Parameter Count=32B, T...
2026.04
42.45
Intern-S1-mini
Parameter Count=8B
2026.04
39.06
Qwen3-VL-235B-A22B-Thinking
Parameter Count=235B-A...
2026.04
33.93
Qwen3-VL-32B-Thinking
Parameter Count=32B, R...
2026.04
30.44
Thyme-VL
Parameter Count=7B
2026.04
27.84
Gemini 2.5 Pro
2026.04
25.14
Gemini 2.5 Flash
2026.04
15.78
Feedback
Search any
task
Search any
task