Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Physics Reasoning on Synthetic Symbolic
Loading...
10.4
Accuracy
Qwen2.5-32B + RL (synthetic)
2.912
4.856
6.8
8.744
Apr 13, 2026
Accuracy
Updated 5d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen2.5-32B + RL (synthetic)
Model=Qwen2.5-32B, Pos...
2026.04
10.4
Qwen2.5-14B + RL (synthetic)
Model=Qwen2.5-14B, Pos...
2026.04
10.4
Qwen2.5-7B + RL (synthetic)
Model=Qwen2.5-7B, Post...
2026.04
9.6
Qwen2.5-3B + RL (synthetic)
Model=Qwen2.5-3B, Post...
2026.04
9.4
Qwen3-30B
Model=Qwen3-30B, Post-...
2026.04
8.8
Qwen3-30B + RL (synthetic)
Model=Qwen3-30B, Post-...
2026.04
8
Qwen2.5-32B
Model=Qwen2.5-32B, Pos...
2026.04
5.6
Qwen2.5-14B
Model=Qwen2.5-14B, Pos...
2026.04
5.6
Qwen2.5-7B
Model=Qwen2.5-7B, Post...
2026.04
5.6
Qwen2.5-3B
Model=Qwen2.5-3B, Post...
2026.04
3.2
Feedback
Search any
task
Search any
task