| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Adversarial Attack | Seaquest | Cumulative Reward2,856.96 | 80 | |
| Reinforcement Learning | Seaquest Atari 2600 (test) | Avg Total Reward28,010 | 5 | |
| Reinforcement Learning | Seaquest | Average Reward5,000 | 4 | |
| Policy Imitation | Seaquest Atari (test) | Accuracy98 | 4 |