Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Safety and Utility Evaluation on MaliciousGen & WildChat

97.69Rule Adherence

Step-Level

97.294897.397497.597.6026Jan 12, 2026
Updated 3mo ago

Evaluation Results

MethodLinks
2026.01
97.6995.38-0.85.24
2026.01
97.6996.35-0.775.68
2026.01
97.3195.77-0.765.33