Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Code Generation on HumanEval (Acc, Step)

59.76Accuracy

KLASS

24.233633.456842.6851.9032Nov 7, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.11
59.7673.73
2025.11
58.53256
2025.11
58.53256
2025.11
43.29128
2025.11
43.29128
2025.11
40.8591.98
2025.11
39.63256
2025.11
35.97256
2025.11
30.48128
2025.11
25.6128