Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Refusal Evaluation on HarmBench

71Baseline Performance

LLaMA-3.1 8B

67.4569.2257172.775Feb 11, 2026
Updated 4d ago

Evaluation Results

MethodLinks
71--
2026.02
-6075.36