Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Safety Evaluation on ActorAttack

3.5ASR

GPT-4o

1.2416.49531.7547.005Feb 28, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.02
3.5
2025.02
4
2025.02
4
2025.02
9
2025.02
20
2025.02
51
2025.02
58.5
2025.02
60