Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Code-writing on HumanEval & MBPP EvalPlus (test)

39.02HumanEval Pass Rate

CRITIQ

28.245631.042833.8436.6372Feb 26, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.02
39.0233.5468.7348.4153.8840.98
2025.02
36.5932.3259.2647.3547.9339.84
2025.02
31.7127.4456.6146.344.1636.87
2025.02
28.6625.6148.9439.1538.832.38