Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Summarization on Reddit TL;DR (test)
Loading...
75.61
Preference vs SFT (%)
Cal-DPO
65.8132
68.3566
70.9
73.4434
Dec 19, 2024
Preference vs SFT (%)
Preference vs Chosen (%)
Average Preference Score (%)
Win Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Preference vs SFT (%)
Preference vs Chosen (%)
Average Preference Score (%)
Win Rate
Cal-DPO
2024.12
75.61
59.37
67.49
-
CPO
2024.12
73.13
58.89
66.01
-
DPOP
2024.12
72.95
58.82
65.89
-
IPO
2024.12
72.17
56.51
64.34
-
DPO
2024.12
71.22
57.58
64.4
-
DPO+NLL
2024.12
69.37
55.26
62.31
-
SLiC
2024.12
68.61
55.72
62.17
-
f-DPO
2024.12
66.19
51.37
58.78
-
SCAR
Comparison Baseline=RL...
2025.12
-
-
-
61.2
SCAR
Comparison Baseline=AB...
2025.12
-
-
-
60.3
Feedback
Search any
task
Search any
task