| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MiniHanabi Co-op | GPT-5-mini | Average Normalized Game Score85.99 | 9 | 27d ago | |
| KuhnPoker vs. NE Bot | Strat-Reasoner-4B | Normalized Score (First Move)94.04 | 9 | 27d ago | |
| Tic-Tac-Toe vs. MCTS Bot, 1000 sims | Gemini-2.5-flash | First-move Normalized Score88.32 | 9 | 27d ago | |
| Tic-Tac-Toe vs. MCTS Bot, 100 sims | Strat-Reasoner-4B | First-move Normalized Score90.77 | 9 | 27d ago |