Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Wild Guard

Benchmarks

Task NameDataset NameSOTA ResultTrend
Response ClassificationWild Guard Text Response
F1 Score93.17
16
Safety ModerationWild Guard Response
F1 Score88.9
12
Safety ModerationWild Guard AR
F1 Score78
8
Safety ModerationWild Guard EN
F1 Score78
8
Showing 4 of 4 rows