Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Open-ended Dialogue on PKU-SafeRLHF OOD
Loading...
67.8
Win Rate
CausalRM
30.152
39.926
49.7
59.474
Jan 29, 2026
Win Rate
Tie Rate
Loss Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Win Rate
Tie Rate
Loss Rate
CausalRM
Opponent=SFT
2026.01
67.8
21.5
10.7
CausalRM
Opponent=Standard RM
2026.01
57.2
29.7
13.1
CausalRM
Opponent=InfoRM
2026.01
42.7
33.1
24.2
CausalRM
Opponent=GoalRM
2026.01
31.6
47.8
20.6
Feedback
Search any
task
Search any
task