Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Semantic Alignment on Safety Concepts Smoking
Loading...
-0.0996
Delta Original
Undefended Model
-0.10458
-0.10209
-0.0996
-0.09711
Feb 22, 2026
Delta Original
Delta System
Gain
Unsafe Content Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Delta Original
Delta System
Gain
Unsafe Content Rate
Undefended Model
2026.02
-0.0996
-
-
-
ReVision
2026.02
-
-0.0902
0.0094
0.0057
Feedback
Search any
task
Search any
task