Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Attack Success Rate Evaluation on HRL/LRL Safety Prompts Welsh Text v1
Loading...
0
Attack Success Rate
Gemini 1.5 Flash
-0.96
5.52
12
18.48
Apr 13, 2025
Attack Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Attack Success Rate
Gemini 1.5 Flash
2025.04
0
Gemini 1.5 Pro
2025.04
0
Claude 3.5 Sonnet
2025.04
2
Claude 3 Haiku
2025.04
6
GPT-4o Mini
2025.04
18
GPT-4o
2025.04
24
Feedback
Search any
task
Search any
task