| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| D4RL walker2d-medium-expert | O2PPO | Normalized Score121.4 | 63 | 1mo ago | |
| D4RL halfcheetah-medium-replay | Off2On | Normalized Score0.8874 | 61 | 1mo ago | |
| D4RL walker2d-medium | IDQL | Normalized Score88.1 | 60 | 1mo ago | |
| D4RL halfcheetah-medium | PRDC | Normalized Score63.5 | 60 | 1mo ago | |
| D4RL halfcheetah-medium-expert | MBOP | Normalized Score105.9 | 53 | 1mo ago | |
| D4RL Ant medium-offline | LoDADA | Normalized Score85.28 | 36 | 1mo ago | |
| D4RL Hopper medium-offline | LoDADA | Score40.77 | 36 | 1mo ago | |
| D4RL Walker2d medium-offline | LoDADA | Normalized Score37.45 | 36 | 1mo ago | |
| D4RL HalfCheetah medium-offline | LoDADA | Normalized Score34.97 | 36 | 1mo ago | |
| Dog & Humanoid suite | WIMLE | IQM0.897 | 32 | 1mo ago | |
| D4RL Hopper-medium | O2PPO | Normalized Score100.42 | 30 | 1mo ago | |
| Hopper IID (test) | RMA | Mean Episode Reward1,859 | 24 | 1mo ago | |
| Walker IID (test) | Mean Episode Reward1,909 | 24 | 1mo ago | ||
| Half Cheetah IID (test) | Mean Episode Reward2,278 | 24 | 1mo ago | ||
| Ant IID (test) | DMAP | Mean Episode Reward2,240 | 24 | 1mo ago | |
| Hopper | DAgger | Convergence (%)100 | 20 | 1mo ago | |
| NeoRL Walker2d Medium | VIPO | Mean Normalized Score76.8 | 19 | 1mo ago | |
| NeoRL HalfCheetah-Medium | NEUBAY | Mean Normalized Score81.1 | 19 | 1mo ago | |
| NeoRL Walker2d Low | VIPO | Mean Normalized Score67.6 | 19 | 1mo ago | |
| NeoRL Hopper-Low | VIPO | Mean Normalized Score30.7 | 19 | 1mo ago | |
| NeoRL Walker2d High | ADMPO | Mean Normalized Score82.2 | 18 | 1mo ago | |
| NeoRL HalfCheetah High | VIPO | Mean Normalized Score89.4 | 18 | 1mo ago | |
| D4RL Hopper-medium-expert | TD3+BC | Normalized Score (100k Steps)112.2 | 18 | 1mo ago | |
| Walker2d Medium-Expert v2 | Tgt+Src (Edited) | Average Normalized Score82.9 | 12 | 1mo ago | |
| Walker2d Medium-Replay v2 | Tgt+Src (Edited) | Average Normalized Score25.9 | 12 | 1mo ago |