Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Harmful safety datasets

Benchmarks

Task NameDataset NameSOTA ResultTrend
Input ModerationHarmful safety datasets Average
Average F1 Score (Input Moderation)88.33
9
Showing 1 of 1 rows