Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Safety Guardrail Classification on 630-scenario real-world benchmark (independent set)

95.4Verdict Accuracy

AgentTrust v0.5

35.651.12566.6582.175May 6, 2026
Updated 27d ago

Evaluation Results

MethodLinks
2026.05
95.42.16.42.04
2026.05
90.54.80.38.6
2026.05
85.13.21.71,271
2026.05
55.198.404,315
2026.05
37.9085.20.03