Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Coding Reasoning on HumanEval

96.34Accuracy (%)

COPT

92.533693.521894.5195.4982May 19, 2026
Updated 14d ago

Evaluation Results

MethodLinks
2026.05
96.341,842
2026.05
94.511,023
2026.05
93.92,627
2026.05
92.682,368