| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Generalized Referring Expression Segmentation | gRefCOCO (val) | cIoU72.2 | 165 | |
| Generalized Referring Expression Segmentation | gRefCOCO (testA) | cIoU75.2 | 159 | |
| Generalized Referring Expression Segmentation | gRefCOCO (testB) | cIoU73.1 | 141 | |
| Generalized Referring Expression Segmentation | gRefCOCO v1 (val) | cIoU76.83 | 33 | |
| Generalized Referring Expression Segmentation | gRefCOCO v1 (test B) | gIoU72.4 | 29 | |
| Generalized Referring Image Segmentation | gRefCOCO (val) | gIoU78.4 | 26 | |
| Reasoning Segmentation | gRefCOCO (testB) | gIoU0.719 | 22 | |
| Reasoning Segmentation | gRefCOCO (testA) | gIoU77.7 | 22 | |
| Referring Expression Comprehension | gRefCOCO (testA) | Precision (F1=1, IoU>=0.5)64.6 | 18 | |
| Generalized Referring Image Segmentation | gRefCOCO (testB) | gIoU62.81 | 13 | |
| Referring Image Segmentation | gRefCOCO (testA) | gIoU72.79 | 13 | |
| Referring Expression Comprehension | gRefCOCO (testB) | Pr(F1=1, IoU>=0.5)54.8 | 11 | |
| Referring Expression Comprehension | gRefCOCO (val) | Precision (F1=1, IoU>=0.5)62.1 | 11 | |
| Multi-object referring segmentation | gRefCOCO (testA) | gIoU78.5 | 9 | |
| Referring Image Segmentation | gRefCOCO (testB) | gIoU73.1 | 9 | |
| Referring Expression Segmentation | gRefCOCO (testB) | cIoU65.2 | 8 | |
| Referring Expression Segmentation | gRefCOCO (testA) | cIoU72.3 | 8 | |
| Referring Expression Segmentation | gRefCOCO (val) | cIoU68.3 | 8 | |
| Generalized Referring Expression Comprehension | gRefCOCO (testB) | Pr@F10.4461 | 7 | |
| Generalized Referring Expression Comprehension | gRefCOCO (val) | Pr@F161.9 | 7 | |
| Referring Image Segmentation | gRefCOCO (val) | mIoU65.3 | 6 | |
| Referring Expression Segmentation | gRefCOCO | N-Accuracy64.55 | 6 | |
| Referring Expression Comprehension | GRefCOCO | Prec@(F1@0.5)63.8 | 4 |