Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agent-SafetyBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Agentic OversightAgent-SafetyBench
Detection Accuracy84.06
42
Agent Safety EvaluationAgent-SafetyBench aggregated clean and five attack types
UBR26.31
30
Agent Safety EvaluationAgent-SafetyBench
Agent-SafetyBench Score72.3
8
Showing 3 of 3 rows