Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Attack Success Rate Evaluation on HRL/LRL Safety Prompts Welsh Multi-Image v1

0ASR

Claude 3 Haiku

-0.00240.01380.030.0462Apr 13, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
0
0
0.02
0.02
0.04
2025.04
0.06