Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Coding on HEval+
Loading...
75
Accuracy
QSUN
28.096
40.273
52.45
64.627
Feb 27, 2026
Feb 28, 2026
Mar 1, 2026
Mar 2, 2026
Mar 3, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
QSUN
Model=Qwen3-8B-Base, #...
2026.03
75
Full-FT
Model=Qwen3-8B-Base, #...
2026.03
74.3
AWQ
Model=Qwen3-8B-Base, #...
2026.03
72.6
Full-FT
Model=LLaMA3.1-8B, # B...
2026.03
45.7
ICaRus
# Model=3, KV Sharing=O
2026.02
43.9
QSUN
Model=LLaMA3.1-8B, # B...
2026.03
42.7
Coding Model
# Model=1, KV Sharing=.
2026.02
41.5
Multi Model
# Model=3, KV Sharing=X
2026.02
41.5
AWQ
Model=LLaMA3.1-8B, # B...
2026.03
39.6
IF Model
# Model=1, KV Sharing=.
2026.02
39
Math Model
# Model=1, KV Sharing=.
2026.02
36.6
Base Model
# Model=1, KV Sharing=.
2026.02
29.9
Feedback
Search any
task
Search any
task