| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Video Generation | Taste-Rob Hand Object Masked Region (test) | L1 Loss20.9 | 6 | |
| Video Generation | Taste-Rob Full Frame (test) | L17.77 | 6 | |
| Goal-image generation | Taste-Rob | LPIPS0.09 | 5 | |
| Planning Video Generation | Taste-Rob (random 200 examples) | FVD8.21 | 3 |