Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agent Authorization Enforcement on AGENTDOJO Overall
Loading...
0
# False Negatives
PAUTH
-0.001
-0.0005
0
0.0005
Mar 17, 2026
# False Negatives
# Injection Runs
# False Positives
# Benign Runs
Updated 1mo ago
Evaluation Results
Method
Method
Links
# False Negatives
# Injection Runs
# False Positives
# Benign Runs
PAUTH
LLM Backbone=GPT-4.1,...
2026.03
0
634
0
100
Feedback
Search any
task
Search any
task