Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Safety Evaluation on RealToxicityPrompts (test)
Loading...
96
Safety Score
Self-Improving Pretraining
86.744
89.147
91.55
93.953
Jan 29, 2026
Safety Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Safety Score
Self-Improving Pretraining
Pre-training Data=RedP...
2026.01
96
Llama Base
Pre-training Strategy=...
2026.01
88.1
Llama Pretrain Baseline
Pre-training Data=RedP...
2026.01
87.1
Feedback
Search any
task
Search any
task