Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Math on GSM-Plus
Loading...
89.74
Score
LLaDA2.0-flash
85.3928
86.5214
87.65
88.7786
Feb 9, 2026
Score
TPF
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
TPF
LLaDA2.0-flash
2026.02
89.74
268
Ling-flash-2.0
2026.02
89.71
100
LLaDA2.1-flash
Inference Mode=Q Mode
2026.02
89.69
383
Qwen3-30B-A3B-Inst-2507
2026.02
89.41
100
LLaDA2.1-flash
Inference Mode=S Mode
2026.02
89.23
714
Ling-mini-2.0
2026.02
87.18
-
LLaDA2.1-mini
mode=Q Mode
2026.02
86.55
3.69
LLaDA2.0-mini
2026.02
86.5
2.41
LLaDA2.1-mini
mode=S Mode
2026.02
85.88
6.82
Qwen3-8B
no think=true
2026.02
85.56
-
Feedback
Search any
task
Search any
task