Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Code Generation on HumanEval, MBPP, HumanEval+, MBPP+, CRUX-O
Loading...
71.95
HumanEval Score
HSA-UL-Inst
38.3372
47.0636
55.79
64.5164
Nov 28, 2025
HumanEval Score
MBPP Score
HumanEval+ Score
MBPP+ Score
CRUX-O Score
Updated 4d ago
Evaluation Results
Method
Method
Links
HumanEval Score
MBPP Score
HumanEval+ Score
MBPP+ Score
CRUX-O Score
HSA-UL-Inst
Architecture=MoE, Tota...
2025.11
71.95
57
70.73
65.87
50.75
Qwen3-Inst
Architecture=Dense, To...
2025.11
65.24
51
61.59
59.52
50
Qwen3-Inst
Architecture=Dense, To...
2025.11
40.24
29.2
35.37
34.39
28
HSA-UL-Inst
Architecture=Dense, To...
2025.11
39.63
34.4
37.2
39.95
23.25
Feedback
Search any
task
Search any
task