Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Refusal Evaluation on WildGuard Harmful

84.35Refusal Rate

Low-Rank Combination

58.006864.845971.68578.5241Mar 9, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
84.35
2026.03
77.72
2026.03
77.19
2026.03
74.27
2026.03
73.87
2026.03
73.74
2026.03
59.02