Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Refusal Evaluation on Harmful prompt suite (test)

95.5Refusal Rate

Base

-3.8221.96547.7573.535Jan 13, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
95.5
2026.01
93.8
2026.01
91.9
2026.01
84
2026.01
83.3
2026.01
42
2026.01
12
2026.01
2
2026.01
2
2026.01
0
2026.01
0
2026.01
0
2026.01
0
2026.01
0
2026.01
0