Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Coding on HumanEval & MBPP

81.7HumanEval Score

Qwen2.5-7B-Instruct

63.29268.07172.8577.629Dec 18, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
81.779.480.6
2025.12
80.576.778.6
2025.12
78.774.376.5
2025.12
787878
2025.12
77.477.877.6
2025.12
73.865.369.6
2025.12
66.566.166.3
2025.12
65.970.468.2
2025.12
64.667.766.2
2025.12
6471.767.8