Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Code Generation on HumanEval (pass@1)
Loading...
62.2
Pass@1
SDAR-4B-Chat
25.4152
34.9651
44.515
54.0649
Jan 30, 2026
Pass@1
Updated 25d ago
Evaluation Results
Method
Method
Links
Pass@1
SDAR-4B-Chat
Model=SDAR-4B-Chat, De...
2026.01
62.2
SDAR-4B-Chat
Model=SDAR-4B-Chat, De...
2026.01
60.37
SDAR-4B-Chat
Model=SDAR-4B-Chat, De...
2026.01
57.93
SDAR-1.7B-Chat
Model=SDAR-1.7B-Chat,...
2026.01
43.29
SDAR-1.7B-Chat
Model=SDAR-1.7B-Chat,...
2026.01
37.8
Qwen2.5-3B-Instruct
Model=Qwen2.5-3B-Instr...
2026.01
32.93
SDAR-1.7B-Chat
Model=SDAR-1.7B-Chat,...
2026.01
31.71
Llama3.2-3B-Instruct
Model=Llama3.2-3B-Inst...
2026.01
26.83
Feedback
Search any
task
Search any
task