Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-turn Chat Evaluation on MT-Bench (val)
Loading...
7.925
Score
DPO
7.899
7.90575
7.9125
7.91925
Feb 6, 2026
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
DPO
Base Model=Meta-Llama-...
2026.02
7.925
SQUAREDPO
Base Model=Meta-Llama-...
2026.02
7.924
χPO
Base Model=Meta-Llama-...
2026.02
7.9
Feedback
Search any
task
Search any
task