| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Policy Optimization | Office World MAP0 | Avg Training Steps4,150 | 18 | |
| Policy optimization | Office World Map 3, Exp 5 | Average Training Steps5,806 | 7 | |
| Policy optimization | Office World Map 2 Exp 5 | Average Training Steps3,767 | 7 | |
| Policy optimization | Office World Map 4 Exp 6 | Average Training Steps5,630 | 7 | |
| Policy optimization | Office World Map 1, Exp 5 | Average Training Steps3,125 | 7 | |
| Policy Optimization | Office World MAP4 | Average Training Steps5,630 | 7 | |
| Policy Optimization | Office World MAP1 | Avg Training Steps3,125 | 7 |