Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Binary Accuracy on SecGenEval-PS CodeAnalysis
Loading...
100
Accuracy
Qwen2.5-Coder-7B
38.64
54.57
70.5
86.43
Jan 10, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen2.5-Coder-7B
Evaluation Mode=M1
2026.01
100
Qwen2.5-Coder-7B
Evaluation Mode=M3
2026.01
100
Qwen2.5-7B
Evaluation Mode=M1
2026.01
100
Qwen2.5-Coder-7B
Evaluation Mode=M2
2026.01
98.5
GPT-4o
Evaluation Mode=M2
2026.01
95.8
GPT-4o
Evaluation Mode=M3
2026.01
95.1
o3-mini
Evaluation Mode=M2
2026.01
94.4
o3-mini
Evaluation Mode=M3
2026.01
89.1
GPT-4o
Evaluation Mode=M1
2026.01
87.2
DeepSeek-R1-Distill-Qwen-7B
Evaluation Mode=M1
2026.01
57.1
o3-mini
Evaluation Mode=M1
2026.01
41
Feedback
Search any
task
Search any
task