Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Chat Evaluation on MT-Bench 1.0 (test)

8MT-Bench Score

Llama-3.1-Instruct

5.54566.18286.827.4572Aug 27, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.08
8--
2024.08
8--
2024.08
7.94--
2024.08
7.787.3
2024.08
7.68.17
2024.08
7.357.826.88
2024.08
7.34--
2024.08
7.327.936.7
2024.08
7.31--
2024.08
7.03--
2024.08
6.97.66.1
2024.08
6.867.566.15
2024.08
6.747.246.24
2024.08
6.57.15.8
2024.08
6.486.836.13
2024.08
6.466.916.01
2024.08
6.4--
2024.08
6.47.255.55
2024.08
5.646.165.11