Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Safety and Utility Evaluation on BeaverTails & LMSYS-Chat
Loading...
97.88
Rule Score
Shadow-Level
67.8864
75.6732
83.46
91.2468
Jan 12, 2026
Rule Score
MD-Judge Score
RM Score
MT-1 Score
Updated 3mo ago
Evaluation Results
Method
Method
Links
Rule Score
MD-Judge Score
RM Score
MT-1 Score
Shadow-Level
Backbone=Qwen2.5-7B, M...
2026.01
97.88
96.92
-0.87
5.17
Step-Level
Backbone=Qwen2.5-7B, M...
2026.01
70.38
58.46
-2.42
5.04
Client-Level
Backbone=Qwen2.5-7B, M...
2026.01
69.04
56.73
-2.52
5.14
Feedback
Search any
task
Search any
task