Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Jailbreak Safety Evaluation on English dataset Multi-Image

14StrongREJECT (Perturbed)

GPT-4o

0.483.997.511.01Apr 13, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.04
141320
1090
8832
878
789
1120