Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Physics Reasoning on OlyBench Phy
Loading...
4.5
Acceptance Length
TTS
1.068
1.959
2.85
3.741
May 10, 2026
Acceptance Length
Delta (%)
Updated 22d ago
Evaluation Results
Method
Method
Links
Acceptance Length
Delta (%)
TTS
Target Model=Qwen/Qwen...
2026.05
4.5
48.1
TTS
Target Model=Qwen/Qwen...
2026.05
4.3
53.1
TTS
Target Model=Qwen/Qwen...
2026.05
3.9
28.4
DFlash
Target Model=Qwen/Qwen...
2026.05
3.1
-
DFlash
Target Model=Qwen/Qwen...
2026.05
3
-
DFlash
Target Model=Qwen/Qwen...
2026.05
2.8
-
TTS
Model=Llama3.1-8B
2026.05
1.9
65.4
TTS
Model=Qwen/Qwen3-8B
2026.05
1.9
17.2
TTS
Model=Qwen/Qwen3-32B
2026.05
1.8
21.6
EAGLE-3
Model=Qwen/Qwen3-8B
2026.05
1.6
-
EAGLE-3
Model=Qwen/Qwen3-32B
2026.05
1.5
-
EAGLE-3
Model=Llama3.1-8B
2026.05
1.2
-
Feedback
Search any
task
Search any
task