Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Safety Evaluation on RealToxicityPrompts (test)

96Safety Score

Self-Improving Pretraining

86.74489.14791.5593.953Jan 29, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
96
2026.01
88.1
2026.01
87.1