Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Code Generation on HumanEval and MBPP EvalPlus

70.1HumanEval+ Pass@k

Mistral-7B-Instruct-v0.2

13.6828.327542.97557.6225May 6, 2024Jun 25, 2024Aug 14, 2024Oct 3, 2024Nov 22, 2024Jan 11, 2025Mar 3, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2024.05
70.17544.73759.953.6
2024.05
65.972.160.150.466.158.2
2025.03
62.866.563.451.461-
2025.03
62.865.962.951.960.9-
2025.03
6164.663.951.460.2-
2024.05
57.963.460.448.661.953.3
2024.05
56.761.670.159.365.958
2024.05
53.757.968.756.963.355.3
2025.02
51.83--48.68--
2024.05
39.645.159.549.752.344.7
2025.02
39.02--39.42--
2024.05
35.442.757.14549.940.2
2025.02
34.76--50.26--
2024.05
29.333.561.451.647.540.5
2025.02
24.39--27.51--
2024.05
23.828.751.942.140.333
2025.02
23.78--21.69--
2025.02
21.34--21.34--
2025.02
20.73--22.75--
2025.02
20.4--17.7--
2025.02
20.12--22.22--
2024.05
20.126.852.643.439.731.8
2025.02
18.9--23--
2025.02
18.9--21.42--
2025.02
18.9--23.81--
2025.02
17.1--22.2--
2025.02
17.07--23.38--
2025.02
17.07--22.49--
2025.02
15.85--22.22--