Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Trustworthiness Evaluation on Trustworthiness Average (human evaluation)
Loading...
0.88
Control Win Rate
Sparse Activation Control
0.84672
0.85536
0.864
0.87264
Nov 4, 2024
Control Win Rate
Tie Rate
Non-Control Win Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Control Win Rate
Tie Rate
Non-Control Win Rate
Sparse Activation Control
2024.11
0.88
0.12
0
Sparse Activation Control
2024.11
0.848
0.092
0.06
Feedback
Search any
task
Search any
task