Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Attack Success Rate Evaluation on HRL/LRL Safety Prompts Welsh Multi-Image v1

0ASR

Claude 3 Haiku

-0.00240.01380.030.0462Apr 13, 2025
Updated 4d ago

Evaluation Results

MethodLinks
0
0
0.02
0.02
0.04
2025.04
0.06