Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Scientific Reasoning on MSEarth-MCQ
Loading...
65.8
Score
Gemini-3-Pro
45.4472
50.7311
56.015
61.2989
Jan 27, 2026
Feb 5, 2026
Feb 15, 2026
Feb 25, 2026
Mar 6, 2026
Mar 16, 2026
Mar 26, 2026
Score
Updated 23d ago
Evaluation Results
Method
Method
Links
Score
Gemini-3-Pro
2026.03
65.8
Intern-S1-Pro
Number of Parameters=1...
2026.03
65.2
GPT-5.2
2026.03
62.6
Kimi-K2.5
Number of Parameters=1...
2026.03
61.9
Intern-S1
variant=mini, paramete...
2026.01
56.93
InternVL3.5
parameters=8B
2026.01
56.5
LLaVA-OV
version=1.5, parameter...
2026.01
54.81
Innovator-VL
variant=8B-Thinking, p...
2026.01
52.87
Qwen3-VL-235B-Thinking
Number of Parameters=2...
2026.03
52.7
Innovator-VL
variant=8B-Instruct, p...
2026.01
52.59
MiniCPM-V
version=4.5, parameter...
2026.01
50.5
Qwen3-VL
parameters=8B
2026.01
48.56
MiMo-VL
training=7B-RL, parame...
2026.01
46.66
MiMo-VL
training=7B-SFT, param...
2026.01
46.23
Feedback
Search any
task
Search any
task