| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| TabArena Multiclass (6) | Average Balanced Accuracy0.8109 | 56 | 1mo ago | ||
| TabArena Lite | AutoGluon | Elo Rating1,521 | 48 | 11d ago | |
| TALENT | NCM | SGMε32.6 | 42 | 11d ago | |
| NSL-KDD | Accuracy83.13 | 36 | 17d ago | ||
| TALENT Multiclass (> 10 classes) Full (avg across datasets) | RealMLP | Rank3.75 | 31 | 11d ago | |
| cleveland | IRP | L1 calibration error0.224 | 26 | 1mo ago | |
| acoustic (test) | mCCAdL | Log Loss (Posterior Expected)0.5615 | 18 | 1mo ago | |
| letter (test) | mCCAdL | Log Loss (Posterior)0.2656 | 18 | 1mo ago | |
| iWildCam WILDS 2.0 (OOD) | FLYP | Macro F146 | 17 | 1mo ago | |
| TALENT datasets not included in TabArena (Remaining) | ModernNCA | SGM Error0.1097 | 10 | 11d ago | |
| iris | Accuracy97.33 | 10 | 1mo ago | ||
| MIC TabArena v0.1 (test) | TabM | LogLoss0.438 | 10 | 1mo ago | |
| Phishing TabArena v0.1 (test) | Agentic Tree | LogLoss0.218 | 10 | 1mo ago | |
| Maternal TabArena v0.1 (test) | TabPFNv2 | LogLoss0.413 | 10 | 1mo ago | |
| Anneal TabArena v0.1 (test) | Agentic Tree | LogLoss0.014 | 10 | 1mo ago | |
| IRIS subsampled to 100 3 classes (train) | Banzhaf | R50070 | 9 | 1mo ago | |
| DIGITS subsampled to 100 (train) | Banzhaf | R50082 | 9 | 1mo ago | |
| WINE | Banzhaf | R50077 | 9 | 1mo ago | |
| zoo | Weighted F1-score100 | 9 | 1mo ago | ||
| wine | Weighted F1100 | 9 | 1mo ago | ||
| vehicle | DARG | Weighted F1-score81 | 9 | 1mo ago | |
| thyroid | Weighted F198.1 | 9 | 1mo ago | ||
| tae | Weighted F1-score67.9 | 9 | 1mo ago | ||
| shuttle | Weighted F11 | 9 | 1mo ago | ||
| pageblocks | DARG | Weighted F1-score98.1 | 9 | 1mo ago |