Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

General Language Understanding on MMLU, AlpacaEval, Arena-Hard

73.41MMLU Accuracy

Qwen2.5-7B + DataFlow-Chat-15K

71.371671.900872.4372.9592Dec 18, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
73.4110.1111028.21
2025.12
73.093.713026.03
2025.12
72.973.978025.91
2025.12
71.457.056026.36