Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LLM Jailbreaking on GPTFuzzer Scenario G2
Loading...
77
Hypervolume
EvoJail
-3.08
17.71
38.5
59.29
Mar 20, 2026
Hypervolume
Updated 27d ago
Evaluation Results
Method
Method
Links
Hypervolume
EvoJail
Target Model=Llama-3.1...
2026.03
77
EvoJail
Target Model=gpt-4.1-Nano
2026.03
75.5
CodeAttack
Target Model=gpt-4.1-Nano
2026.03
71.3
EvoJail
Target Model=Llama-2-7...
2026.03
67
CodeAttack
Target Model=Llama-3.1...
2026.03
59.5
ReNeLLM
Target Model=Llama-3.1...
2026.03
52.5
CodeChameleon
Target Model=gpt-4.1-Nano
2026.03
52
CodeChameleon
Target Model=Llama-3.1...
2026.03
50
Jailbroken
Target Model=Llama-3.1...
2026.03
50
FlipAttack
Target Model=Llama-3.1...
2026.03
37.2
CodeChameleon
Target Model=Llama-2-7...
2026.03
35.5
Jailbroken
Target Model=Llama-2-7...
2026.03
23.6
ReNeLLM
Target Model=Llama-2-7...
2026.03
16.3
FlipAttack
Target Model=gpt-4.1-Nano
2026.03
12.8
ReNeLLM
Target Model=gpt-4.1-Nano
2026.03
11
FlipAttack
Target Model=Llama-2-7...
2026.03
4.7
CodeAttack
Target Model=Llama-2-7...
2026.03
4.2
Cipher
Target Model=Llama-3.1...
2026.03
2
Jailbroken
Target Model=gpt-4.1-Nano
2026.03
1.4
Cipher
Target Model=Llama-2-7...
2026.03
0
Cipher
Target Model=gpt-4.1-Nano
2026.03
0
Feedback
Search any
task
Search any
task