Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Summarization on Preference Optimization Summarization
Loading...
0.3
Reward
ZOPrO
0.2272
0.2461
0.265
0.2839
Mar 5, 2025
Reward
Updated 4d ago
Evaluation Results
Method
Method
Links
Reward
ZOPrO
Model=Qwen 2.5, Parame...
2025.03
0.3
ZOPrO
Model=Gemma 2, Paramet...
2025.03
0.23
Feedback
Search any
task
Search any
task