| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Unsupervised Video Object Segmentation | MOVi-E 24 frames (val) | mIoU47.5 | 8 | |
| Unsupervised Image Segmentation | MOVi-E individual frames | FG-ARI65.1 | 7 | |
| Multi-object editing | MOVi-E | PSNR22.03 | 6 | |
| Property Prediction | MOVi-E | Position MSE0.01 | 5 | |
| Object-centric learning | MOVI-E (test) | FG-ARI70.6 | 3 | |
| Object Discovery | MOVI-E (val) | FG-ARI64.7 | 3 | |
| Downstream Property Prediction | MOVi-E 1.0 (test) | Position MSE1.85 | 3 | |
| Unsupervised Object Segmentation | MOVi-E 1.0 (test) | mBO38.96 | 3 | |
| Compositional Image Generation | MOVi-E (test) | FID64.76 | 3 |