Share your thoughts, 1 month free Claude Pro on usSee more

Python reference-model generation on RTLLM v2

77.3Pass@1

ChipMate-Python-9B

Updated 2mo ago

Evaluation Results

Method	Links
ChipMate-Python-9B 2026.05		77.3	81
ChipMate-Python-4B 2026.05		75.3	80.1
DeepSeek R1 2026.05		49.6	57.8
GPT-5.5 2026.05		48.5	55.4
DeepSeek V4 2026.05		44.4	54.3
CodeV-R1 2026.05		42.7	51.8
DeepSeek Coder 2026.05		42.4	50
Qwen3.5-9B 2026.05		40.1	51.8
Qwen3.5-4B 2026.05		38.2	46.1
CodeV-R1 (distill) 2026.05		36	46.2