HumanEval+

Benchmarks

Task Name	Dataset Name	SOTA Result
Code Generation	HumanEval+ (test)	Pass@198.1	132
Code Generation	HumanEval+ v1 (test)	Pass Rate87.8	55
Code Generation	HumanEval+ Out-of-Domain (test)	Accuracy81.71	18
Code Reasoning	HumanEval+	Pass@1697	15
Code Generation	HumanEval+	Score34.76	11
Unit test generation	HumanEval+ (test)	Error Rate1.27	7
Code Generation	HumanEval+ ko	Score92.1	3

Showing 7 of 7 rows