Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Safety and Utility Evaluation on BeaverTails & LMSYS-Chat

97.88Rule Score

Shadow-Level

67.886475.673283.4691.2468Jan 12, 2026
Updated 3mo ago

Evaluation Results

MethodLinks
2026.01
97.8896.92-0.875.17
2026.01
70.3858.46-2.425.04
2026.01
69.0456.73-2.525.14