Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Attack Success Rate Evaluation on HRL/LRL Safety Prompts English, Single Image v1
Loading...
0
ASR
Claude 3.5 Sonnet
-0.0288
0.1656
0.36
0.5544
Apr 13, 2025
ASR
Updated 4d ago
Evaluation Results
Method
Method
Links
ASR
Claude 3.5 Sonnet
2025.04
0
GPT-4o Mini
2025.04
0.03
Claude 3 Haiku
2025.04
0.1
GPT-4o
2025.04
0.13
Gemini 1.5 Pro
2025.04
0.23
Gemini 1.5 Flash
2025.04
0.72
Feedback
Search any
task
Search any
task