Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Human Evaluation on HH dataset
Loading...
59
Win Rate
RRHF_DP
-2.36
13.57
29.5
45.43
Apr 11, 2023
Win Rate
Tie Rate
Loss Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Win Rate
Tie Rate
Loss Rate
RRHF_DP
rho=Alpaca, compared_a...
2023.04
59
30
11
RRHF_DP
rho=Alpaca, compared_a...
2023.04
27
48
25
RRHF_DP
rho=Alpaca, compared_a...
2023.04
0
90
10
Feedback
Search any
task
Search any
task