| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| D4RL halfcheetah-medium-expert | VIPO | Normalized Score110 | 155 | 4d ago | |
| D4RL hopper-medium-expert | ATAC | Normalized Score119.2 | 153 | 4d ago | |
| D4RL walker2d-medium-expert | RRPI | Normalized Score115.7 | 124 | 4d ago | |
| D4RL Medium-Replay Hopper | NEUBAY | Normalized Score110.6 | 97 | 4d ago | |
| D4RL Medium HalfCheetah | SUMO | Normalized Score84.3 | 97 | 4d ago | |
| D4RL Medium Walker2d | NEUBAY | Normalized Score106.4 | 96 | 4d ago | |
| D4RL walker2d-random | AWAC | Normalized Score510 | 93 | 4d ago | |
| D4RL halfcheetah-random | ADMPO | Normalized Score45.4 | 86 | 4d ago | |
| D4RL Medium-Replay HalfCheetah | RAMBO | Normalized Score77.6 | 84 | 4d ago | |
| D4RL hopper-random | MOREL | Normalized Score53.6 | 78 | 4d ago | |
| D4RL Gym walker2d (medium-replay) | ROMI-CQL | Normalized Return109.7 | 68 | 4d ago | |
| D4RL Walker2d Medium v2 | PMDB | Normalized Return94.2 | 67 | 1mo ago | |
| D4RL AntMaze | KFC++ | AntMaze Umaze Return99.8 | 65 | 24d ago | |
| D4RL Medium Hopper | RRPI | Normalized Score109.4 | 64 | 4d ago | |
| D4RL walker2d-medium-replay | NEUBAY | Normalized Score99.3 | 62 | 4d ago | |
| Kitchen Partial | GCPC | Normalized Score90.2 | 62 | 1mo ago | |
| D4RL Gym halfcheetah-medium | SPQR | Normalized Return74.8 | 60 | 4d ago | |
| D4RL halfcheetah v2 (medium-replay) | CQL | Normalized Score76.9 | 58 | 1mo ago | |
| D4RL Gym walker2d medium | RORL | Normalized Return102.4 | 58 | 4d ago | |
| hopper medium | QDFM | Normalized Score3,729 | 58 | 1mo ago | |
| D4RL halfcheetah-expert v2 | EDAC | Normalized Score106.8 | 56 | 1mo ago | |
| D4RL walker2d-expert v2 | PMDB | Normalized Score115.9 | 56 | 1mo ago | |
| D4RL hopper-expert v2 | CEIL | Normalized Score113 | 56 | 1mo ago | |
| OGBench antmaze-large-navigate-singletask task1-v0 to task5-v0 | GFP | Score95.6 | 55 | 1mo ago | |
| D4RL Hopper-medium-replay v2 | EDAC | Normalized Return107.4 | 54 | 1mo ago |