| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Classification | tic-tac-toe | F1 Score100 | 26 | |
| Classification | tic-tac-toe | Accuracy100 | 21 | |
| Classification | tic-tac-toe | ROC-AUC100 | 15 | |
| Classification | tic-tac-toe | F1 Macro100 | 12 | |
| Classification | UCI tic-tac-toe (80%:20%) | Classification Error1.67 | 9 | |
| Active Learning | Tic-Tac-Toe | AULC18.9 | 8 | |
| Tabular Classification | Tic-Tac-Toe | Cohen's Kappa0.233 | 8 | |
| Robot Manipulation | Tic-Tac-Toe SO-101 | Success Rate56.7 | 8 | |
| Multi-Agent Strategic Reasoning | Tic-Tac-Toe (In-domain) | Success Rate67.2 | 8 | |
| Classification | tic-tac-toe | Latency (s)0.002 | 6 | |
| Classification | tic-tac-toe | Rule Count69.9 | 4 | |
| Conceptual Clustering | Tic-tac-toe | Number of Patterns811 | 2 | |
| SHAP value approximation | tic-tac-toe (test) | R^20.958 | 2 | |
| Strategic game playing | Tic-Tac-Toe (train) | Win Rate54.05 | 2 |