Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Harmful prompts dataset

Benchmarks

Task NameDataset NameSOTA ResultTrend
Jailbreak attack success rateHarmful prompts dataset
Attack Success Rate97
49
Showing 1 of 1 rows