Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Accuracy and Latency on MT-Bench (Multi-turn conversation)

8.54Accuracy

Vanilla

6.47047.00777.5458.0823Dec 7, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
8.5410.55
2025.12
8.4911.95
8.3111.61
2025.12
8.1213.66
2025.12
7.526.21
2025.12
6.695.06
2025.12
6.5510.07