Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Coding on CRUXEval-O
Loading...
87.5
Score
LLaDA2.1-flash
69.9448
74.5024
79.06
83.6176
Feb 9, 2026
Score
TPF
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
TPF
LLaDA2.1-flash
Inference Mode=Q Mode
2026.02
87.5
380
Qwen3-30B-A3B-Inst-2507
2026.02
86.75
100
LLaDA2.1-flash
Inference Mode=S Mode
2026.02
85.25
654
LLaDA2.0-flash
2026.02
85.12
321
Ling-flash-2.0
2026.02
82.75
100
Ling-mini-2.0
2026.02
76.12
-
Qwen3-8B
no think=true
2026.02
74.06
-
LLaDA2.1-mini
mode=Q Mode
2026.02
73.75
3.35
LLaDA2.0-mini
2026.02
71.62
2.78
LLaDA2.1-mini
mode=S Mode
2026.02
70.62
5.85
Feedback
Search any
task
Search any
task