Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

HEval

Benchmarks

Task NameDataset NameSOTA ResultTrend
Code generationHEval
Pass@168.29
21
CodingHEval
Accuracy84.8
20
Instruction-followingHEval
PASS@146.12
12
CodingHEval+
Accuracy75
12
Showing 4 of 4 rows