Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Code Generation on HumanEval 0-shot (test)

57.3Pass@1

d2Cache

24.33232.89141.4550.009Sep 27, 2025Oct 8, 2025Oct 19, 2025Oct 30, 2025Nov 10, 2025Nov 21, 2025Dec 3, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.09
57.3--6.323.8104.4----
2025.12
51.9---------
2025.12
50---------
2025.12
50---------
2025.12
48.8---------
2025.09
46.6--11.899.748.39----
2025.12
42.1---------
2025.12
39.6---------
2025.12
38.4---------
2025.12
38.4---------
2025.12
37.8---------
2025.12
36---------
2025.12
34.7---------
2025.12
29.3---------
2025.12
28.1---------
2025.12
26.8---------
2025.12
25.6---------
2025.12
-37.8--------
2025.12
-37.2--------
2025.12
-37.2--------
2025.12
-40.1--------
2025.12
-40.3--------
2026.01
--41.513.31-----
2026.01
--39.612.91-----
2026.01
--42.77.91.7-----
2026.01
--43.33.43.9-----
2026.01
--43.42.55.3-----
2026.01
--43.939.41-----
2026.01
--45.132.91.2-----
2026.01
--45.721.41.8-----
2026.01
--44.594.4-----
2026.01
--44.55.27.6-----
2025.12
--35.97--220.812.28---
2025.12
--43.29--163.721.74---
2025.12
--32.92--132.281.37---
2025.12
--41.46--131.141.38---
2025.12
--37.19--96.841---
2025.12
--45.12--94.411---
2025.11
---11.3-7.4-25683.937.8
2025.11
---10.8-12.6-256135.939
2025.11
---4.6-17.9-100.383.136.6
2025.11
---4.2-18.9-97.780.236
2025.11
---1.9-50.9-32.395.140.2