Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Code Completion on CVDP non-agentic 1.0
Loading...
24.47
Pass@1
GPT-o4 Mini
-0.9788
5.6281
12.235
18.8419
Dec 4, 2025
Pass@1
Updated 3mo ago
Evaluation Results
Method
Method
Links
Pass@1
GPT-o4 Mini
mode=agentic framework
2025.12
24.47
GPT-o4 Mini
mode=single-shot
2025.12
17.02
Granite-4
mode=single-shot
2025.12
9.57
Nemotron-Mini
mode=single-shot
2025.12
4.26
SmolLM
mode=single-shot
2025.12
1.03
SmolLM
mode=agentic framework
2025.12
1.03
Nemotron-Mini
mode=agentic framework
2025.12
0
DeepSeek-R1
mode=single-shot
2025.12
0
DeepSeek-R1
mode=agentic framework
2025.12
0
Granite-4
mode=agentic framework
2025.12
0
Feedback
Search any
task
Search any
task