Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Knowledge on PHYBench
Loading...
30.06
Score
LLaDA2.0-flash
8.948
14.429
19.91
25.391
Feb 9, 2026
Score
TPF
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
TPF
LLaDA2.0-flash
2026.02
30.06
270
Qwen3-30B-A3B-Inst-2507
2026.02
29.84
100
LLaDA2.1-flash
Inference Mode=Q Mode
2026.02
28.23
266
Ling-flash-2.0
2026.02
27.67
100
LLaDA2.1-flash
Inference Mode=S Mode
2026.02
26.04
410
Ling-mini-2.0
2026.02
14.59
-
LLaDA2.1-mini
mode=Q Mode
2026.02
13.05
0.0252
LLaDA2.1-mini
mode=S Mode
2026.02
12.75
0.0441
LLaDA2.0-mini
2026.02
11.7
0.0248
Qwen3-8B
no think=true
2026.02
9.76
-
Feedback
Search any
task
Search any
task