| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Adversarial Attack | Seaquest | Cumulative Reward2,856.96 | 80 | |
| Reinforcement Learning | Seaquest Atari (classical) | Reward4,759 | 10 | |
| Atari Game Playing | Seaquest | Game Score356,584 | 6 | |
| Reinforcement Learning | Seaquest Atari 2600 (test) | Avg Total Reward28,010 | 5 | |
| Deep Reinforcement Learning | Seaquest Atari 2600 | IQM Return1,642 | 4 | |
| Reinforcement Learning | Seaquest | Average Reward5,000 | 4 | |
| Policy Imitation | Seaquest Atari (test) | Accuracy98 | 4 |