Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Code benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Code
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
HumanEval
Qwen3-Next-80B-A3B
HumanEval Accuracy
95.1
79
3d ago
MBPP
PrivCode
Pass@1
77.9
49
11d ago
HumanEval+
SUN
Accuracy
79.9
34
1mo ago
LiveCodeBench V5-6
Reasoning Memory
Accuracy
50.8
33
15d ago
LiveCodeBench V1-4
Reasoning Memory
Accuracy
47.1
33
15d ago
HumanEval (test)
LightMoE
HumanEval Success Rate
58.1
14
1mo ago
LCB v6
GPT-5.2
Score
87.7
11
22d ago
MBPP 1,000-example (test)
Qwen3-VL-2B-Instruct
Perplexity
9.0212
10
1mo ago
HumanEval pass@1
CodePref
Pass@1
67.07
9
1mo ago
SWE Verified Agentless
DeepSeek-R1 0528 671B
pass@1
57.6
8
1mo ago
LCB Pro Med 25Q2
Nemotron-Cascade 14B-Thinking
pass@1
10.5
7
1mo ago
LCB Pro Easy 25Q2
Nemotron-Cascade 14B-Thinking
Pass@1
68.9
7
1mo ago
LCB 08/24-02/25 v5
Nemotron-Cascade 14B-Thinking
pass@1
77.5
7
1mo ago
CRUX
Qwen2.5-14B-Instruct-1M
Accuracy
66.4
6
1mo ago
APT-Bench
Qwen3
Accuracy
41.9
6
1mo ago
MBPP+
AdaRAS
Accuracy
60.58
6
1mo ago
GRAFITE Sample Code
Llama-4-Maverick-17B-128E-Instruct
Pass Rate
100
4
29d ago
EnConda-Bench
Youtu-LLM 2B
Accuracy
0.215
4
1mo ago
CruxEval o
Engram-40B
Exact Match
35.3
4
1mo ago
CruxEval-i
Engram-40B
Exact Match
36.2
4
1mo ago
MBPP Sanitized
N-3-Super 120B-A12B-Base
pass@1
78.38
3
3d ago
LiveCodeBench EN
Qwen3-8B
Score
63.39
2
1mo ago
Showing 22 of 22 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs