Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Toxicity Classification on Toxicity
Loading...
90.4
Original Accuracy
AdvDemo + CW
85.88
88.14
90.4
92.66
Jan 29, 2026
Original Accuracy
AD
ASRR (Fake)
ASRR (Temp)
ASRR (Needle)
Updated 4d ago
Evaluation Results
Method
Method
Links
Original Accuracy
AD
ASRR (Fake)
ASRR (Temp)
ASRR (Needle)
AdvDemo + CW
Recipe Code=p10_CWmess...
2026.01
90.4
3.8
61.6
80
63.6
AdvDemo + Random Template
Recipe Code=p10_length10
2026.01
90.4
7
9.8
0
94.8
AdvDemo + CW + Random Template
Variant=Toxicity I, Re...
2026.01
90.4
11.6
57.2
9,360
94.8
AdvDemo + CW + Random Template
Variant=Toxicity II, R...
2026.01
90.4
4.2
53.4
0
94.8
Feedback
Search any
task
Search any
task