Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Prompt Injection Detection on QualifirePI
Loading...
87
F1 Score
Apriel Guard
6.92
27.71
48.5
69.29
Dec 23, 2025
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
Apriel Guard
Model Size=8B, Reasoni...
2025.12
87
Apriel Guard
Model Size=8B, Reasoni...
2025.12
85
IBM Granite Guardian
Version=3.1, Model Siz...
2025.12
79
IBM Granite Guardian
Version=3.2, Model Siz...
2025.12
78
IBM Granite Guardian
Version=3.2, Model Siz...
2025.12
78
IBM Granite Guardian
Version=3.3, Model Siz...
2025.12
77
Llama Prompt Guard 2
Model Size=0.086B, Rea...
2025.12
68
IBM Granite Guardian
Version=3.3, Model Siz...
2025.12
68
Llama Guard
Version=3, Reasoning=f...
2025.12
59
Llama Guard
Version=4, Reasoning=f...
2025.12
59
Llama Guard
Version=2, Reasoning=f...
2025.12
42
gpt-oss-safeguard
Model Size=20B, Reason...
2025.12
42
ShieldGemma
Model Size=9B, Reasoni...
2025.12
35
Qwen3Guard
Model Size=8B, Constra...
2025.12
25
Qwen3Guard
Model Size=8B, Constra...
2025.12
10
Feedback
Search any
task
Search any
task