Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Semantic Alignment on Safety Concepts Multi-Concept

-0.0396Delta Original

Undefended Model

-0.04158-0.04059-0.0396-0.03861Feb 22, 2026
Updated 4d ago

Evaluation Results

MethodLinks
-0.0396---
2026.02
-0.01980.05930.0458