Our new X account is live! Follow @wizwand_team for updates
Search any
task
Feedback
Search any
task
SOTA LLM Agent Defense benchmarks and papers with code | Wizwand
Our new X account is live! Follow @wizwand_team for updates
Home
/
Tasks
LLM Agent Defense
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
AgentDojo Overall
Repeat Prompt
Clean Utility
84.54
12
4d ago
AgentDojo Slack
No Defense
Clean Utility
80.95
12
4d ago
AgentDojo Workspace
Task Shield
Clean Utility
85
12
4d ago
Showing 3 of 3 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task