Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Code Generation on HumanEval (Accuracy, Token Efficiency, and Runtime)

48.2Acc

Fixed-Length Denoising

15.85624.25332.6541.047Jan 30, 2026
Updated 3mo ago

Evaluation Results

MethodLinks
2026.01
48.2691.12,04833.816,046
2026.01
45.146251290.21,474
2026.01
45.1631.11,02461.64,569
2026.01
44.5494.781364.81,283
2026.01
43.9643.1769.983.5593
2026.01
43.3570.465488.4580
2026.01
36.6247.325696.6543
2026.01
26.212512897.7230
2026.01
17.159.66493.2111