Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Scientific Multimodal Reasoning on VRSBench
Loading...
74.32
Accuracy
S1-VL-32B-RL
54.1128
59.3589
64.605
69.8511
Apr 23, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
S1-VL-32B-RL
Parameter Count=32B, T...
2026.04
74.32
S1-VL-32B-SFT
Parameter Count=32B, T...
2026.04
72.34
Qwen3-VL-235B-A22B-Thinking
Parameter Count=235B-A...
2026.04
68.94
Qwen3-VL-32B-Thinking
Parameter Count=32B, R...
2026.04
68.41
GPT-5
2026.04
65.89
Gemini 2.5 Pro
2026.04
65.7
Gemini 2.5 Flash
2026.04
64.23
Intern-S1
Parameter Count=235B+6B
2026.04
63.48
Thyme-VL
Parameter Count=7B
2026.04
57.61
Intern-S1-mini
Parameter Count=8B
2026.04
54.89
Feedback
Search any
task
Search any
task