Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LLM Red-teaming on Rainbow Teaming defended Target Model
Loading...
0
UA
SFT
-4.4
25.3
55
84.7
May 1, 2026
UA
ASR
Updated 1mo ago
Evaluation Results
Method
Method
Links
UA
ASR
SFT
2026.05
0
0
DPO
2026.05
0
0
PPO
2026.05
0.33
0.03
PPO + Curiosity
2026.05
0.33
0.03
Jailbreak R1
CoT=enabled
2026.05
5.67
4.82
ICL
2026.05
9.67
0.94
GFN
2026.05
14
64.13
Rainbow Teaming
2026.05
18
1.76
S-GFN
2026.05
110
83.24
Feedback
Search any
task
Search any
task