| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Reinforcement Learning | InvPendSwingup | Episodes Completed2,000,000 | 3 | |
| Reinforcement Learning | InvPendSwingup standard (test) | Episode Length13 | 2 | |
| Interpretability Evaluation | InvPendSwingup | Interpretability Score4.3 | 2 |