Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Coding on HEval
Loading...
83.5
Accuracy
Full-FT
34.724
47.387
60.05
72.713
Feb 27, 2026
Feb 28, 2026
Mar 1, 2026
Mar 2, 2026
Mar 3, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Full-FT
Model=Qwen3-8B-Base, #...
2026.03
83.5
QSUN
Model=Qwen3-8B-Base, #...
2026.03
82.3
AWQ
Model=Qwen3-8B-Base, #...
2026.03
79.3
Full-FT
Model=LLaMA3.1-8B, # B...
2026.03
48.2
Coding Model
# Model=1, KV Sharing=.
2026.02
48.2
Multi Model
# Model=3, KV Sharing=X
2026.02
48.2
ICaRus
# Model=3, KV Sharing=O
2026.02
48.2
QSUN
Model=LLaMA3.1-8B, # B...
2026.03
47.6
AWQ
Model=LLaMA3.1-8B, # B...
2026.03
45.7
IF Model
# Model=1, KV Sharing=.
2026.02
44.5
Math Model
# Model=1, KV Sharing=.
2026.02
42.7
Base Model
# Model=1, KV Sharing=.
2026.02
36.6
Feedback
Search any
task
Search any
task