Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Content Moderation on Lexica UnsafeBench (test)

65Hate Safety Score

GGuard

23.434.24555.8Dec 22, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
6560.670.569.850.660.665.151.173.561.555.962
2025.12
2563.571.261.382.787.570.142.290.962.117.170.7