Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Agent Security benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Agent Security
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
A3S-BENCH Advanced 1.0
Sonnet 4.5
RTR@1
19.68
11
12d ago
A3S-BENCH Basic 1.0
Qwen3.5-35B
RTR@1 (%)
45.27
11
12d ago
ASB (Agent Security Benchmark)
AttriGuard
No Attack UA
90
8
2mo ago
Showing 3 of 3 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task