| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Generalized Referring Expression Segmentation | gRefCOCO (testA) | cIoU75.2 | 115 | |
| Generalized Referring Expression Segmentation | gRefCOCO (val) | cIoU72 | 98 | |
| Generalized Referring Expression Segmentation | gRefCOCO (testB) | cIoU73.1 | 97 | |
| Generalized Referring Expression Segmentation | gRefCOCO v1 (val) | cIoU76.83 | 33 | |
| Generalized Referring Expression Segmentation | gRefCOCO v1 (test B) | gIoU72.4 | 29 | |
| Referring Expression Comprehension | gRefCOCO (testA) | Precision (F1=1, IoU>=0.5)64.6 | 18 | |
| Referring Expression Comprehension | gRefCOCO (testB) | Pr(F1=1, IoU>=0.5)54.8 | 11 | |
| Referring Expression Comprehension | gRefCOCO (val) | Precision (F1=1, IoU>=0.5)62.1 | 11 | |
| Generalized Referring Expression Comprehension | gRefCOCO (testB) | Pr@F10.4461 | 7 | |
| Generalized Referring Expression Comprehension | gRefCOCO (val) | Pr@F161.9 | 7 | |
| Referring Image Segmentation | gRefCOCO (val) | mIoU65.3 | 6 | |
| Referring Expression Segmentation | gRefCOCO | N-Accuracy64.55 | 6 | |
| Referring Image Segmentation | gRefCOCO (testB) | mIoU62.18 | 5 | |
| Referring Image Segmentation | gRefCOCO (testA) | mIoU70.98 | 5 |