| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Reinforcement Learning | POPGym Noisy Stateless Pendulum Hard | MMER0.657 | 9 | |
| Reinforcement Learning | POPGym Stateless Pendulum (Hard) | MMER82.8 | 9 | |
| Reinforcement Learning | POPGym Noisy Stateless CartPole (Hard) | MMER20.7 | 9 | |
| Reinforcement Learning | POPGym Stateless CartPole Hard | MMER0.127 | 9 | |
| Reinforcement Learning | POPGym Aggregated (48 tasks) | Aggregated Return (All)10.4 | 6 | |
| Offline Reinforcement Learning | POPGym | Average Normalized Score (All)9.5 | 5 | |
| Memory | POPGym Copy k=10 | Temporal Range16.715 | 4 | |
| Memory | POPGym Copy k=5 | Temporal Range17.255 | 4 | |
| Memory | POPGym Copy k=3 | Temporal Range17.312 | 4 | |
| Memory | POPGym Copy k=1 | Temporal Range12.294 | 4 | |
| Memory | POPGym RepeatFirst | Temporal Range21.177 | 4 | |
| Control | POPGym Noisy Stateless CartPole | Temporal Range15.274 | 4 | |
| Control | POPGym Stateless CartPole | Temporal Range13.704 | 4 | |
| Control | POPGym CartPole | Temporal Range12.362 | 4 |