| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Bank Marketing | MostlyAI | F1 Score88.5 | 15 | 16d ago | |
| Boston Housing | HELIX | RMSE1.747 | 7 | 1mo ago | |
| Adult Income | HELIX | F1 Score82.07 | 7 | 1mo ago | |
| Timely-Eval | TimelyLM-8B | Leaf Classification Accuracy0.939 | 7 | 1mo ago | |
| Transparent Conductors | HELIX | RMSE0.049 | 6 | 1mo ago | |
| MMLU Machine Learning 1.0 (test) | TextGrad | Accuracy88.4 | 4 | 1mo ago | |
| NanoGPT Speedrun Competition | NanoGPT Score96.8 | 2 | 1mo ago |