| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Backdoor Attack on Reinforcement Learning | Q*bert Discrete (evaluation) | BR18,052 | 5 | |
| Reinforcement Learning | Q*bert Atari 2600 (test) | Average Total Reward18,900 | 5 | |
| Deep Reinforcement Learning | Q*bert Atari 2600 | IQM Return8,160 | 4 |