| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| DS-1000 | DMoA | Accuracy64.34 | 19 | 16d ago | |
| MBPP (test) | Alpaca-GPT4 | Accuracy51.58 | 12 | 2mo ago | |
| H-Eval (test) | Alpaca-GPT4 + NAIT (CodeX) | Accuracy28.49 | 12 | 2mo ago | |
| LiveCodeBench (LCB) | CreditDecoding | Score14.37 | 6 | 1mo ago | |
| OpenAI HumanEval | Baseline | HumanEval Score51.22 | 6 | 1mo ago |