Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SafeDialBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Safety Dialogue EvaluationSafeDialBench
Score8.42
33
Safety DetectionSafeDialBench (full)
Recall99
12
Jailbreak DetectionSafeDialBench
AUC0.912
9
Safety dialogue evaluationSafeDialBench
Normalized Score61.33
5
Unsafe-input detectionSafeDialBench EN
Recall99.07
2
Showing 5 of 5 rows