Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Text-inject Detection (benign task-switch) on BIPIA email
Loading...
14
FPR
LCF (Qwen-2.5-7B)
13.96
14.23
14.5
14.77
Apr 27, 2026
FPR
TPR
TPR - FPR Difference
Cohen's d
Metric Value (100 Scale)
Updated 1mo ago
Evaluation Results
Method
Method
Links
FPR
TPR
TPR - FPR Difference
Cohen's d
Metric Value (100 Scale)
LCF (Qwen-2.5-7B)
Backbone=Qwen-2.5-7B,...
2026.04
14
100
86
1.45
-
LCF (Gemma-2-9B)
Backbone=Gemma-2-9B, E...
2026.04
14
100
86
3.05
-
LCF (Llama-3-8B)
Backbone=Llama-3-8B, E...
2026.04
15
100
85
2.33
-
LCF (Qwen-2.5-14B)
Backbone=Qwen-2.5-14B,...
2026.04
15
100
85
1.41
-
Feedback
Search any
task
Search any
task