Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Coding on LiveCodeBench (ACC, UT)
Loading...
46.7
Accuracy
Qwen3-8B
18.7032
25.9716
33.24
40.5084
Jan 20, 2026
Jan 28, 2026
Feb 5, 2026
Feb 13, 2026
Feb 21, 2026
Mar 1, 2026
Mar 10, 2026
Accuracy
Unit Test Success Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Unit Test Success Rate
Qwen3-8B
Inference Mode=think
2026.03
46.7
-
SSA-LLM-8B
Inference Mode=think
2026.03
40.11
-
RAM
2026.01
31.96
47.72
DARE+TA
2026.01
31.95
46.69
RAM+
2026.01
31.6
46.84
WUDI
2026.01
30.04
44.48
Qwen3-8B
Inference Mode=no-think
2026.03
23.08
-
SSA-LLM-8B
Inference Mode=no-think
2026.03
19.78
-
Feedback
Search any
task
Search any
task