Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Jailbreak Safety Evaluation on English dataset Multi-Image
Loading...
14
StrongREJECT (Perturbed)
GPT-4o
0.48
3.99
7.5
11.01
Apr 13, 2025
StrongREJECT (Perturbed)
StrongREJECT (Unperturbed)
StrongREJECT (Cipher)
Updated 4d ago
Evaluation Results
Method
Method
Links
StrongREJECT (Perturbed)
StrongREJECT (Unperturbed)
StrongREJECT (Cipher)
GPT-4o
2025.04
14
13
20
Claude 3.5 Sonnet
2025.04
10
9
0
Gemini 1.5 Flash
2025.04
8
8
32
Gemini 1.5 Pro
2025.04
8
7
8
Claude 3 Haiku
2025.04
7
8
9
GPT-4o Mini
2025.04
1
1
20
Feedback
Search any
task
Search any
task