| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Long-horizon robot manipulation | Calvin ABCD→D | Task 1 Completion Rate99.4 | 127 | |
| Robotic Manipulation | CALVIN ABCD->D | Avg Length0.4 | 89 | |
| Long-horizon task completion | Calvin ABC->D | Success Rate (1)96.8 | 67 | |
| Robot Manipulation | CALVIN (ABC->D) | Average Successful Length4.75 | 48 | |
| Sequential Robotic Manipulation | CALVIN | Success Rate (1 task)99.8 | 45 | |
| Robotic Manipulation | CALVIN D→D | Average Length4.52 | 40 | |
| Long-horizon robotic manipulation | CALVIN ABC-D | Task 1 Success Rate98.4 | 34 | |
| Instruction-following robotic manipulation | CALVIN ABC→D (unseen environment D) | Success Rate (Length 1)98.5 | 29 | |
| Robotic Manipulation | Calvin ABC-D | Task-1 Score100 | 26 | |
| Robot Manipulation | CALVIN ABC->D 1.0 | Success Rate (1 Inst)96.8 | 18 | |
| Long-horizon language-conditioned policy learning | CALVIN | Success Rate (Step 5/5)98.4 | 16 | |
| Long-horizon robotic manipulation | CALVIN ABC→D Zero-shot | Task 1 Success Rate98.8 | 16 | |
| Long-horizon robot manipulation | CALVIN | Task Completion Rate (1)96.3 | 15 | |
| Long-horizon task completion | CALVIN | Success Rate (1 Task)93.8 | 15 | |
| Long-Horizon Multi-Task Language Control | CALVIN ABC→D (test) | Seq Success (1)96 | 13 | |
| Language-Conditioned Manipulation | CALVIN MTLC | Success Rate95 | 12 | |
| Long-horizon task success | CALVIN D→D long-horizon | Success Rate (LH-1)99.5 | 11 | |
| Robot manipulation | CALVIN 10% ABCD → D | Success Rate (L=1)84.1 | 11 | |
| Language-conditioned manipulation | CALVIN LH-MTLC | Success Rate (1 Instruction)97.5 | 10 | |
| Failure Detection | DSMF-CALVIN (test) | Accuracy90.64 | 10 | |
| Language-conditioned long-horizon robotic manipulation | CALVIN ABC→D | Success Rate (1 Task)99.6 | 8 | |
| Language-conditioned visuomotor control | CALVIN ABC→D (Zero-shot) | Completion Rate (Seq 1)96 | 8 | |
| Robot Manipulation | Calvin ABC -> D | Average Path Length0.45 | 7 | |
| Robot Manipulation | Calvin D -> D | Average Length2.92 | 7 | |
| Track prediction | CALVIN ABC → D (test) | Success Rate (δ < 4)43.7 | 7 |