| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Constrained Multi-objective Optimization | MW | Runtime (s)20.04 | 9 | |
| In-Context Reinforcement Learning | MW 70-1 DR9 | NAUC33 | 4 | |
| In-Context Reinforcement Learning | MW 40-1 DR9 | NAUC35 | 4 | |
| In-Context Reinforcement Learning | MW 20-1 DR9 | NAUC32 | 4 | |
| Failure / predicate detection | MW Pick-Place | F1 Score99.2 | 4 |