SOTA Coding Ability benchmarks and papers with code

Benchmarks

Dataset Name	SOTA Method	Metric
DS-1000	DMoA	Accuracy64.34	19	2mo ago
MBPP (test)	Alpaca-GPT4	Accuracy51.58	12	4mo ago
H-Eval (test)	Alpaca-GPT4 + NAIT (CodeX)	Accuracy28.49	12	4mo ago
LiveCodeBench (LCB)	CreditDecoding	Score14.37	6	3mo ago
OpenAI HumanEval	Baseline	HumanEval Score51.22	6	3mo ago

Showing 5 of 5 rows