Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Secure LLM Agent Task Completion on AgentDojo
Loading...
76.3
Benign Utility
Progent
53.7944
59.6372
65.48
71.3228
Jun 13, 2025
Benign Utility
Attacked Utility
Attack Success Rate (ASR)
Updated 23d ago
Evaluation Results
Method
Method
Links
Benign Utility
Attacked Utility
Attack Success Rate (ASR)
Progent
Base Model=GPT-4o
2025.06
76.3
61.2
2.2
DRIFT
Base Model=GPT-4o
2025.06
71.61
62
1.66
DRIFT
Base Model=GPT-4o-mini
2025.06
57.29
50.93
1.35
Progent
Base Model=GPT-4o-mini
2025.06
54.66
45.58
9.39
Feedback
Search any
task
Search any
task