| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Simulation Task Generalization | lock-in benchmark T6 [S] | Success Rate13 | 2 | |
| Simulation Task Generalization | lock-in benchmark T5 [S] | Success Count11 | 2 | |
| Simulation Task Generalization | lock-in benchmark T1 | Success Count16 | 2 |