| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Adversarial Reinforcement Learning | Connect Four 100% optimal adversary (test-time) | Avg Return-0.98 | 3 | |
| Adversarial Reinforcement Learning | Connect Four 70% optimal adversary (test) | Average Return0.02 | 3 | |
| Adversarial Reinforcement Learning | Connect Four 50% optimal adversary (test-time) | Average Return0.11 | 3 | |
| Adversarial Reinforcement Learning | Connect Four 30% optimal adversary (test-time) | Average Return0.55 | 3 | |
| Strategic game playing | Connect Four held-out (test) | Win Rate21.55 | 2 | |
| Contingent Planning | Connect Four | Metric- | 0 |