Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Safety Evaluation

Benchmarks

Task NameDataset NameSOTA ResultTrend
Safety EvaluationSafety Evaluation
Overall Safety Score73.97
9
Safety EvaluationSafety Evaluation
Illegal Content Count0
3
Safety DetectionSafety Evaluation Strict Safety Mode
Precision50
1
Showing 3 of 3 rows