Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Safety Detection on ToxicChat (held-out)

87.7AUROC

MultiLayer-DIM

74.59677.99881.484.802May 18, 2026
Updated 13d ago

Evaluation Results

MethodLinks
2026.05
87.7
2026.05
81.8
2026.05
79.9
2026.05
79.8
2026.05
75.1