Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General LLM Evaluation on MT-Bench zh

6.66Overall Score

Qwen2.5-14B

4.87125.33565.86.2644Jun 30, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.06
6.66
2025.06
6.36
2025.06
6.03
2025.06
5.78
2025.06
5.59
2025.06
5.03
2025.06
4.94