Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Open-ended Dialogue on Anthropic-Helpful (ID)
Loading...
0.762
Win Rate
CausalRM
0.48952
0.56026
0.631
0.70174
Jan 29, 2026
Win Rate
Tie Rate
Lose Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Win Rate
Tie Rate
Lose Rate
CausalRM
Opponent=SFT
2026.01
0.762
0.194
0.044
CausalRM
Opponent=Standard RM
2026.01
0.537
0.353
0.11
CausalRM
Opponent=InfoRM
2026.01
0.524
0.38
0.096
CausalRM
Opponent=GoalRM
2026.01
0.5
0.39
0.11
Feedback
Search any
task
Search any
task