Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agent Authorization Enforcement on AGENTDOJO Workspace suite
Loading...
0
False Negatives (#FN)
PAUTH
-0.001
-0.0005
0
0.0005
Mar 17, 2026
False Negatives (#FN)
Injection Attempts Count
False Positives (#FP)
Benign Attempts Count
Updated 1mo ago
Evaluation Results
Method
Method
Links
False Negatives (#FN)
Injection Attempts Count
False Positives (#FP)
Benign Attempts Count
PAUTH
LLM Backbone=GPT-4.1,...
2026.03
0
205
0
40
Feedback
Search any
task
Search any
task