Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Direct Preference Optimization on Skywork AlpacaEval 2.0
Loading...
20.56
LCWR
Difficulty-Based Preference Data Selection
1.8504
6.7077
11.565
16.4223
Aug 6, 2025
LCWR
Win Rate
Updated 16d ago
Evaluation Results
Method
Method
Links
LCWR
Win Rate
Difficulty-Based Preference Data Selection
Selection Method=Ours
2025.08
20.56
19.38
Random
Selection Method=Random
2025.08
19.6
18.57
ZIP†
Selection Method=ZIP
2025.08
18.74
18.96
Full Set
Selection Method=Full Set
2025.08
18.13
17.54
DiverseEvol†
Selection Method=Diver...
2025.08
17.75
18.33
SDPO
Selection Method=SDPO
2025.08
17.46
18.56
Tulu3-SFT
Selection Method=SFT B...
2025.08
2.57
2.16
Feedback
Search any
task
Search any
task