Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SafeDialBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Safety Dialogue EvaluationSafeDialBench
Score8.42
33
Safety DetectionSafeDialBench (full)
Recall99
12
Safety dialogue evaluationSafeDialBench
Normalized Score61.33
5
Unsafe-input detectionSafeDialBench EN
Recall99.07
2
Showing 4 of 4 rows