Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Conversational Assistant on Preference Optimization Conversational
Loading...
0.28
Reward
ZOPrO
0.124
0.1645
0.205
0.2455
Mar 5, 2025
Reward
Updated 4d ago
Evaluation Results
Method
Method
Links
Reward
ZOPrO
Model=Qwen 2.5, Parame...
2025.03
0.28
ZOPrO
Model=Gemma 2, Paramet...
2025.03
0.13
Feedback
Search any
task
Search any
task