Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Attack Success Rate Evaluation on HRL/LRL Safety Prompts English Multi-Image v1

2ASR

GPT-4o Mini

0.610.0519.528.95Apr 13, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2
25
26
27
30
2025.04
37