Share your thoughts, 1 month free Claude Pro on usSee more

Binary Accuracy on SecGenEval-PS CodeAnalysis

100Accuracy

Qwen2.5-Coder-7B

Updated 5mo ago

Evaluation Results

Method	Links
Qwen2.5-Coder-7B 2026.01		100
Qwen2.5-Coder-7B 2026.01		100
Qwen2.5-7B 2026.01		100
Qwen2.5-Coder-7B 2026.01		98.5
GPT-4o 2026.01		95.8
GPT-4o 2026.01		95.1
o3-mini 2026.01		94.4
o3-mini 2026.01		89.1
GPT-4o 2026.01		87.2
DeepSeek-R1-Distill-Qwen-7B 2026.01		57.1
o3-mini 2026.01		41