Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MaliciousGen & WildChat

Benchmarks

Task NameDataset NameSOTA ResultTrend
Safety EvaluationMaliciousGen & WildChat (test)
Rule Adherence97.69
8
Robust Safety and Utility Evaluation in Federated LearningMaliciousGen & WildChat
Rule Score81.35
8
Safety and Utility EvaluationMaliciousGen & WildChat
Rule Adherence97.69
3
Showing 3 of 3 rows