Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agent Perturbation Reliability Testing on AgentHarm
Loading...
90.6
Accuracy
GPT-4o
80.928
83.439
85.95
88.461
Mar 5, 2026
Accuracy
Error Rate
FPR
FNR
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Error Rate
FPR
FNR
GPT-4o
2026.03
90.6
9.4
6.3
12.5
Llama 4 Maverick 17B
2026.03
90.6
9.4
6.3
12.5
Best Trio Ensemble
Ensemble Mode=Best Trio
2026.03
90.6
9.4
6.3
12.5
Claude Opus 4.5
2026.03
81.3
18.8
6.3
31.3
Gemini 2.5 Pro
2026.03
81.3
18.8
25
12.5
Feedback
Search any
task
Search any
task