Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Kernel Generation on KernelBench Overall
Loading...
25
CR (Round 1)
Codex
4.2
9.6
15
20.4
Mar 11, 2026
CR (Round 1)
CR (Final)
Acc (Round 1)
Acc (Final)
Updated 1mo ago
Evaluation Results
Method
Method
Links
CR (Round 1)
CR (Final)
Acc (Round 1)
Acc (Final)
Codex
Model=GPT-5.2
2026.03
25
83
8
46
EvoKernel
Model=Qwen3-Coder-30B
2026.03
13
18
3
5.5
Pass@k
Model=GPT-5.2
2026.03
13
24.5
5
11
Pass@k
Model=Qwen3-Coder-30B
2026.03
11
16
3.5
4
Pass@k
Model=DeepSeek-V3.2
2026.03
11
23
3.5
4.5
EvoKernel
Model=GPT-5.2
2026.03
11
98.5
4
83
Refinement
Model=GPT-5.2
2026.03
10.5
71.5
4
22
Refinement
Model=DeepSeek-V3.2
2026.03
9
35
0
6
Refinement
Model=Qwen3-Coder-30B
2026.03
6.5
11.5
1
3
EvoKernel
Model=DeepSeek-V3.2
2026.03
5
29
1
9.5
Feedback
Search any
task
Search any
task