Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Jailbreaking on Llama3 TAR

32Success Rate First (SRF)

Adaptive Probe-based Steering

-1.287.361624.64May 19, 2026
Updated 13d ago

Evaluation Results

MethodLinks
322450
2026.05
252240
2026.05
191427
2026.05
151627
2026.05
111
2026.05
000