Share your thoughts, 1 month free Claude Pro on usSee more

RTL – Code Improvement (Linting / QoR) on CVDP non-agentic 1.0

51.25Pass@1

DeepSeek-R1

Updated 5mo ago

Evaluation Results

Method	Links
DeepSeek-R1 2025.12		51.25
Granite-4 2025.12		48.75
GPT-o4 Mini 2025.12		44.74
GPT-o4 Mini 2025.12		41
Nemotron-Mini 2025.12		36
SmolLM 2025.12		30
DeepSeek-R1 2025.12		21.25
Granite-4 2025.12		20.51
Nemotron-Mini 2025.12		20
SmolLM 2025.12		18.75