Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-label Toxic Content Classification on Jigsaw-ML (Adversarial Robustness)

71.7Attack Success Rate

AT1-unk

70.61877.921585.22592.5285Apr 9, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.04
71.791.0811.84
2024.04
98.7549.386.96