Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Prompt Injection Detection on QualifirePI
Loading...
87
F1 Score
Apriel Guard
6.92
27.71
48.5
69.29
Dec 23, 2025
F1 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
F1 Score
Apriel Guard
Model Size=8B, Reasoni...
2025.12
87
Apriel Guard
Model Size=8B, Reasoni...
2025.12
85
IBM Granite Guardian
Version=3.1, Model Siz...
2025.12
79
IBM Granite Guardian
Version=3.2, Model Siz...
2025.12
78
IBM Granite Guardian
Version=3.2, Model Siz...
2025.12
78
IBM Granite Guardian
Version=3.3, Model Siz...
2025.12
77
Llama Prompt Guard 2
Model Size=0.086B, Rea...
2025.12
68
IBM Granite Guardian
Version=3.3, Model Siz...
2025.12
68
Llama Guard
Version=3, Reasoning=f...
2025.12
59
Llama Guard
Version=4, Reasoning=f...
2025.12
59
Llama Guard
Version=2, Reasoning=f...
2025.12
42
gpt-oss-safeguard
Model Size=20B, Reason...
2025.12
42
ShieldGemma
Model Size=9B, Reasoni...
2025.12
35
Qwen3Guard
Model Size=8B, Constra...
2025.12
25
Qwen3Guard
Model Size=8B, Constra...
2025.12
10
Feedback
Search any
task
Search any
task