Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Physics-Scene Visual Reasoning on Physics
Loading...
54.35
Accuracy
S1-VL-32B-RL
10.7324
22.0562
33.38
44.7038
Apr 23, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
S1-VL-32B-RL
Parameter Count=32B, T...
2026.04
54.35
GPT-5
2026.04
48.34
Qwen3-VL-235B-A22B-Thinking
Parameter Count=235B-A...
2026.04
46.03
Intern-S1
Parameter Count=235B+6B
2026.04
44.95
S1-VL-32B-SFT
Parameter Count=32B, T...
2026.04
43.8
Qwen3-VL-32B-Thinking
Parameter Count=32B, R...
2026.04
41.71
Gemini 2.5 Pro
2026.04
40
Gemini 2.5 Flash
2026.04
37.1
Intern-S1-mini
Parameter Count=8B
2026.04
30.53
Thyme-VL
Parameter Count=7B
2026.04
12.41
Feedback
Search any
task
Search any
task