Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Accuracy and Latency on MT-Bench (Multi-turn conversation)

8.54Accuracy

Vanilla

6.47047.00777.5458.0823Dec 7, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.12
8.5410.55
2025.12
8.4911.95
8.3111.61
2025.12
8.1213.66
2025.12
7.526.21
2025.12
6.695.06
2025.12
6.5510.07