Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Code-inject detection (malicious code) on BIPIA code-QA
Loading...
100
TPR
LCF (Qwen-2.5-14B)
40.72
56.11
71.5
86.89
Apr 27, 2026
TPR
TPR - FPR Difference
Cohen's d
Success Rate (>0)
Updated 1mo ago
Evaluation Results
Method
Method
Links
TPR
TPR - FPR Difference
Cohen's d
Success Rate (>0)
LCF (Qwen-2.5-14B)
Backbone=Qwen-2.5-14B,...
2026.04
100
81
2.18
99
LCF (Gemma-2-9B)
Backbone=Gemma-2-9B, E...
2026.04
91
77
1.41
95
LCF (Qwen-2.5-7B)
Backbone=Qwen-2.5-7B,...
2026.04
88
75
1.64
95
LCF (Llama-3-8B)
Backbone=Llama-3-8B, E...
2026.04
43
26
0.86
83
Feedback
Search any
task
Search any
task