Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Jailbreaking on Llama3 RB

71Success Rate First (SRF)

Adaptive Probe-based Steering

-1.817.13654.9May 19, 2026
Updated 13d ago

Evaluation Results

MethodLinks
718698
2026.05
647088
2026.05
264144
2026.05
638
2026.05
417
2026.05
141