| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Robotic Image Segmentation | OCID | mIoU91.24 | 27 | |
| Language-guided grasp detection and segmentation | OCID VLG (Multi-Split) | J@10.854 | 11 | |
| Instance Segmentation | OCID RGB only (test) | AP5078.2 | 9 | |
| Grasping | OCID-VLG (overall) | J@1 Success Rate87.32 | 6 | |
| Localization | OCID-Ref (val) | Accuracy48.8 | 6 | |
| Language-guided grasping | OCID-VLG (Novel-Classes Split) | J@146 | 2 |