Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Code on HumanEval (Accuracy)

93.4HumanEval Accuracy

GPT-5

15.34835.611555.87576.1385May 26, 2025Jul 10, 2025Aug 25, 2025Oct 10, 2025Nov 24, 2025Jan 9, 2026Feb 24, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
93.4-
2026.01
92.1-
2026.01
91.5-
2026.01
91.5-
2026.01
85.4-
2026.01
85.4-
2026.01
81.5-
2026.01
80.5-
2026.01
79.2-
2026.01
79.19-
2026.01
78.7-
2026.01
77.85-
2026.01
77.18-
2026.01
76.2-
2026.01
75.4-
2026.01
75-
2026.01
73-
2026.01
72.8-
2026.01
72.6-
2026.01
72.3-
2026.01
72.2-
2026.01
72.1-
2026.01
70.9-
2026.01
70.8-
2026.01
70.7-
2026.01
69.9-
2026.01
68.9-
2026.01
66.8-
2026.01
65.2-
2026.01
59.06-
2026.01
57.72-
2026.01
54.9-
2026.02
54.262,012
2026.02
53.832,282
2026.01
53-
2025.05
53-
2025.05
51.83-
2025.05
50.61-
2025.05
50-
2025.05
46.95-
2026.02
46.551,802
2025.05
45.73-
2026.01
45.64-
2026.02
43.422,042
2025.05
41.46-
2025.05
41.46-
2026.01
37.8-
2025.05
32.32-
2026.02
21.82,132
2026.02
18.352,461