Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
RTL – Natural Language Specification to Code on CVDP non-agentic 1.0
Loading...
30.77
Pass@1
GPT-o4 Mini
-1.2308
7.0771
15.385
23.6929
Dec 4, 2025
Pass@1
Updated 3mo ago
Evaluation Results
Method
Method
Links
Pass@1
GPT-o4 Mini
mode=single-shot
2025.12
30.77
GPT-o4 Mini
mode=agentic framework
2025.12
17.95
Granite-4
mode=single-shot
2025.12
6.41
Nemotron-Mini
mode=single-shot
2025.12
1.28
Nemotron-Mini
mode=agentic framework
2025.12
0
SmolLM
mode=single-shot
2025.12
0
SmolLM
mode=agentic framework
2025.12
0
DeepSeek-R1
mode=single-shot
2025.12
0
DeepSeek-R1
mode=agentic framework
2025.12
0
Granite-4
mode=agentic framework
2025.12
0
Feedback
Search any
task
Search any
task