Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Red-teaming on S-GFN-defended Target Model

7.33Unsuccessful Attack Rate (UA)

S-GFN

-0.29321.68593.6655.6441May 1, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.05
7.3375
2026.05
5.6755
2.3323
2026.05
110
2026.05
0.333
2026.05
00
2026.05
00
2026.05
00
00