| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Sawyer Pusher (test) | Average Return-23.36 | 6 | 3mo ago | ||
| Sweeper (test) | Average Return-50.86 | 6 | 3mo ago | ||
| Ant (test) | Average Return968.8 | 6 | 3mo ago | ||
| Point Maze (test) | Average Return-5.21 | 6 | 3mo ago | ||
| MESSENGER (S1) | LED-WM | Win Rate100,000 | 4 | 3mo ago | |
| MESSENGER-WM NewAll (test) | LED-WM | Average Sum of Scores1.16 | 4 | 3mo ago | |
| MESSENGER-WM NewAttr (test) | LED-WM | Average Score115 | 4 | 3mo ago | |
| MESSENGER-WM NewCombo (test) | LED-WM | Avg Sum Score1.31 | 4 | 3mo ago | |
| MESSENGER (S3) | CRL | Win Rate32,190 | 3 | 3mo ago | |
| MESSENGER (S2) | EMMA (w/o curriculum) | Win Rate4,512 | 3 | 3mo ago | |
| Disabled-Ant meta (test) | Meta-IL | Average Return-27.86 | 3 | 3mo ago | |
| Point-Maze-Shift (meta-test) | Meta-IL | Average Return-28.61 | 3 | 3mo ago | |
| MESSENGER S2 (dev) | - | - | 0 | 3mo ago |