| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Goal Conditioned Visual Navigation | SCAND | ATE1.065 | 18 | |
| Visual Fidelity | SCAND | FID80.97 | 15 | |
| Trajectory Prediction | SCAND T=8 | L2 Error (m)0.923 | 10 | |
| Trajectory Prediction | SCAND T=5 | L2 Error (m)0.674 | 10 | |
| Action Prediction | SCAND (test) | Action MSE0.47 | 8 | |
| Goal-Conditioned Visual Navigation | SCAND (evaluation) | Success Rate (SR)68 | 8 | |
| Trajectory Prediction | SCAND T=10 | L2 Error (m)1.089 | 8 | |
| Visual Navigation | SCAND | MSE0.48 | 5 | |
| Trajectory Prediction | SCAND | ATE1.14 | 4 | |
| Navigation Planning | Scand (val) | ATE1.038 | 3 | |
| Action-Conditioned Consistency | SCAND 16s horizon | LPIPS0.495 | 3 | |
| Action-Conditioned Consistency | SCAND 8s horizon | LPIPS0.459 | 3 | |
| Action-Conditioned Consistency | SCAND 4s horizon | LPIPS0.421 | 3 | |
| Action-Conditioned Consistency | SCAND 2s horizon | LPIPS0.395 | 3 | |
| Action-Conditioned Consistency | SCAND 1s horizon | LPIPS0.368 | 3 | |
| Inference Efficiency | SCAND | Average Rollout Time (s)2.3 | 3 | |
| Video Synthesis | SCAND | FVD401.699 | 2 |