Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Harmful Content Detection on Standard Harmful Content Datasets Evasion Attack

96Phishing

GAVEL

85.688.39193.7Jan 27, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
969597100100987197100
2026.01
919556668587919297
2026.01
8691298054941009184