Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agent Safety Evaluation on SafeToolBench

85.7UAR

Prompt-only safety

-3.42819.71142.8565.989May 18, 2026
Updated 14d ago

Evaluation Results

MethodLinks
2026.05
85.7--
2026.05
0--