Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

OR

Benchmarks

Task NameDataset NameSOTA ResultTrend
Toxicity ClassificationOR
Harmonic F149.7
26
Over-refusal ComplianceOR Seemingly Toxic
Compliance Rate (Keyword Filter)87
5
Showing 2 of 2 rows