Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Safety Evaluation on BeaverTails & WildChat (test)

97.5Rule Adherence Score

Shadow-Level

59.90469.664579.42589.1855Jan 12, 2026
Updated 3mo ago

Evaluation Results

MethodLinks
2026.01
97.595.96-0.85.39
2026.01
83.8575.77-1.155.49
2026.01
83.6572.12-1.345.13
2026.01
72.8860.96-1.685.64
2026.01
72.8857.88-1.825.27
2026.01
65.7743.65-2.315.41
2026.01
63.6541.15-2.345.23
2026.01
61.3537.69-2.495.47