Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Jailbreak Defense Evaluation on EMRA RQ
Loading...
2.3
ASR
CoopGuard
1.412
7.406
13.4
19.394
Apr 5, 2026
ASR
Updated 12d ago
Evaluation Results
Method
Method
Links
ASR
CoopGuard
Target LLM=GPT-5
2026.04
2.3
CoopGuard
Target LLM=Gemini-2.5-Pro
2026.04
6.3
GoalPriority
Target LLM=GPT-5
2026.04
7.2
GoalPriority
Target LLM=Gemini-2.5-Pro
2026.04
7.5
Self-Reminder
Target LLM=DeepSeek-V3
2026.04
7.9
Self-Reminder
Target LLM=Gemini-2.5-Pro
2026.04
8.9
Self-Reminder
Target LLM=GPT-5
2026.04
11.4
CoopGuard
Target LLM=DeepSeek-V3
2026.04
13.8
PAT
Target LLM=DeepSeek-V3
2026.04
14.8
GoalPriority
Target LLM=DeepSeek-V3
2026.04
15.3
PAT
Target LLM=Gemini-2.5-Pro
2026.04
16.6
RPO
Target LLM=Gemini-2.5-Pro
2026.04
16.8
RPO
Target LLM=DeepSeek-V3
2026.04
17
SecurityLingua
Target LLM=GPT-5
2026.04
17.4
PAT
Target LLM=GPT-5
2026.04
18.7
RPO
Target LLM=GPT-5
2026.04
20.7
SecurityLingua
Target LLM=DeepSeek-V3
2026.04
21.5
SecurityLingua
Target LLM=Gemini-2.5-Pro
2026.04
24.5
Feedback
Search any
task
Search any
task