Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Jailbreak Defense Evaluation on EMRA HQ
Loading...
0
ASR
SecurityLingua
-0.372
2.139
4.65
7.161
Apr 5, 2026
ASR
Updated 12d ago
Evaluation Results
Method
Method
Links
ASR
SecurityLingua
Target LLM=GPT-5
2026.04
0
CoopGuard
Target LLM=GPT-5
2026.04
0
GoalPriority
Target LLM=Gemini-2.5-Pro
2026.04
0
CoopGuard
Target LLM=Gemini-2.5-Pro
2026.04
0
Self-Reminder
Target LLM=DeepSeek-V3
2026.04
0
CoopGuard
Target LLM=DeepSeek-V3
2026.04
0
RPO
Target LLM=GPT-5
2026.04
0.3
Self-Reminder
Target LLM=Gemini-2.5-Pro
2026.04
0.3
GoalPriority
Target LLM=DeepSeek-V3
2026.04
0.6
GoalPriority
Target LLM=GPT-5
2026.04
1
Self-Reminder
Target LLM=GPT-5
2026.04
1.1
PAT
Target LLM=GPT-5
2026.04
1.3
PAT
Target LLM=Gemini-2.5-Pro
2026.04
2.1
RPO
Target LLM=Gemini-2.5-Pro
2026.04
2.1
PAT
Target LLM=DeepSeek-V3
2026.04
3.1
RPO
Target LLM=DeepSeek-V3
2026.04
4.3
SecurityLingua
Target LLM=Gemini-2.5-Pro
2026.04
9.3
SecurityLingua
Target LLM=DeepSeek-V3
2026.04
9.3
Feedback
Search any
task
Search any
task