HumanEval and MBPP

Benchmarks

Task Name	Dataset Name	SOTA Result
Code Generation	HumanEval and MBPP	Overall Average Score85.6	78
Code Generation	HumanEval and MBPP EvalPlus	HumanEval+ Pass@k70.1	29
Code Generation	HumanEval+ and MBPP+	HumanEval+ Score75	6
Code-writing	HumanEval & MBPP EvalPlus (test)	HumanEval Pass Rate39.02	4

Showing 4 of 4 rows