Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Malicious Instruction Detection on MaliciousAgentSkillsBench (traditional IPI baselines)
Loading...
63.93
Precision
RouteGuard
22.4028
33.1839
43.965
54.7461
Apr 24, 2026
Precision
Recall
F1 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Precision
Recall
F1 Score
RouteGuard
2026.04
63.93
57.5
78.67
RENNERVATE
2026.04
47.76
55.28
70.17
ASen
2026.04
36.76
22.5
27.17
PArm
2026.04
24
20
21.2
Feedback
Search any
task
Search any
task