Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

violence prompts

Benchmarks

Task NameDataset NameSOTA ResultTrend
Red-teamingViolence prompts
Failure Rate (FR)0
48
Jailbreak evaluationviolence prompts
Non-Unsafe Rate99.9
4
Showing 2 of 2 rows