Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Scientific Multimodal Reasoning on ScienceOlympiad
Loading...
41.38
Accuracy
GPT-5
0.3936
11.0343
21.675
32.3157
Apr 23, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-5
2026.04
41.38
Qwen3-VL-235B-A22B-Thinking
Parameter Count=235B-A...
2026.04
36.47
Intern-S1
Parameter Count=235B+6B
2026.04
36.47
S1-VL-32B-RL
Parameter Count=32B, T...
2026.04
33.5
S1-VL-32B-SFT
Parameter Count=32B, T...
2026.04
33
Gemini 2.5 Flash
2026.04
28.57
Gemini 2.5 Pro
2026.04
24.13
Qwen3-VL-32B-Thinking
Parameter Count=32B, R...
2026.04
22.35
Intern-S1-mini
Parameter Count=8B
2026.04
11.82
Thyme-VL
Parameter Count=7B
2026.04
1.97
Feedback
Search any
task
Search any
task