Secure Code Generation

Benchmarks

Dataset Name	SOTA Method	Metric
Python (test)	SPARK + LoRA	Safe Code Rate94.4	66	1mo ago
Java (evaluation)	LoRA	Safe Code Rate98.6	66	1mo ago
C++ (test)	LoRA	Safe Code Rate100	66	1mo ago
LLMSecEval 150 tasks	MA-CoT	Number of Vulnerabilities0	36	2mo ago
CWEval		pass@148.2	29	2mo ago
CWEval		Functionality92.27	22	4mo ago
CodeGuard+	Hybrid (CodeGuard + SCS)	Pass@185.93	18	4mo ago
CyberSecEval SCG	SafeCoder	Safety79.06	17	4mo ago
LLMSecEval	gpt-5	Total Vulnerabilities0	12	2mo ago
Primary Dataset	gemini-2.5	Total Vulnerabilities8	12	2mo ago
Secure Code Average	SecCoderX	Safety Score55.36	12	4mo ago
CYBERSECEVAL		Autocomplete79.38	8	23d ago
SecHolmesEval	P10 Hybrid Pipeline	Insecure Generation Rate1.9	8	4mo ago
SecLLMEval		Insecure Generation Rate2.7	8	4mo ago
Secure Code Generation Scenarios 1.0 (test)	gemini-2.5-pro (Reflex)	Security Success Rate0.971	8	4mo ago
Secure Code generation	BEAVER	RDR42	8	4mo ago
CVS (test)	Llama3-70b-instruct	C++ Success Rate98	8	4mo ago
COBALT Security Prompts 500 prompts per model		Vulnerability Rate48.4	7	3mo ago
SecurityEval Python	TSP	SPR@175.8	5	1mo ago
CyberSecEval Instruct	Mistral-7B (fine-tuned)	Secure Code Generation (%)86.01	2	3mo ago

Showing 20 of 20 rows