| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| DMControl | DINO | DMControl: Ball in Cup Catch Score916.9 | 29 | 3mo ago | |
| Beta Tracking | SAC | Median Samples12,000 | 24 | 3mo ago | |
| Mountain Car | UP(mlp(wide)) | Robustness Gap0.15 | 12 | 1mo ago | |
| Acrobot | UP(lstm) | Robustness Gap0.32 | 12 | 1mo ago | |
| LunarLander | UP(mlp(wide)) | Robustness Gap0.18 | 12 | 1mo ago | |
| Pendulum v0 | TIP | Median Samples21 | 9 | 3mo ago | |
| Cart Inverted Pendulum System | SOPPI | MSE (x)0.53 | 8 | 2mo ago | |
| Reacher v2 | TIP | Median Samples251 | 8 | 3mo ago | |
| Cartpole swing-up | BARL | Median Samples111 | 8 | 3mo ago | |
| Custom Pedestrian-Crossing Dataset (Evening) | LMDrive | Steer MAE0.099 | 7 | 1d ago | |
| Custom Pedestrian-Crossing Dataset Noon | LMDrive | Steering MAE0.104 | 7 | 1d ago | |
| Beta + Rotation | SAC | Median Samples Required30 | 6 | 3mo ago | |
| Quanser Aero 2 Real world | CH-PPO | Average Return-55.8 | 4 | 12d ago | |
| Quanser Aero 2 (simulation) | CH-PPO | Average Return-49.2 | 4 | 12d ago | |
| Quadrotor | MSI0.29 | 4 | 20d ago | ||
| CartPole | MSI (Aggregate State Error)0.04 | 4 | 20d ago | ||
| Pendulum | MSI0.05 | 4 | 20d ago | ||
| Surface Vehicle 10 initial states | PGDK-Online | Time (ms)0.042 | 4 | 1mo ago | |
| POPGym Noisy Stateless CartPole | Temporal Range15.274 | 4 | 3mo ago | ||
| POPGym Stateless CartPole | Temporal Range13.704 | 4 | 3mo ago | ||
| POPGym CartPole | LinOSS | Temporal Range12.362 | 4 | 3mo ago | |
| Nonlinear Gain 1 | oTIP | Median Samples to Solve41 | 4 | 3mo ago | |
| BALLBALANCE | Mean Consecutive Max Score499 | 3 | 1mo ago | ||
| Lava Path | oTIP | Median Samples to Solve41 | 3 | 3mo ago | |
| Nonlinear Gain 2 | oTIP | Median Samples to Solve51 | 3 | 3mo ago |