Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Instruction Following on MT-bench and AlpacaEval

1.55Aggregated P

NovelSelect

1.031.1651.31.435Feb 24, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.02
1.55
2025.02
1.32
2025.02
1.31
2025.02
1.25
2025.02
1.2
2025.02
1.05