Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HarmEval

Benchmarks

Task NameDataset NameSOTA ResultTrend
Prosocial AlignmentHarmEval (test)
MIP76.3
14
Showing 1 of 1 rows