Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Prompt Harmfulness Detection on AegisSafety (test)

74.8F1 Score

MPNet-based NBF

4.80822.97941.1559.321Feb 28, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.02
74.8
2025.02
74.1
2025.02
74
2025.02
31.9
2025.02
7.5