| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Open-Vocabulary Semantic Segmentation | ADE-847 | mIoU18.1 | 59 | |
| Semantic Segmentation | ADE-847 | mIoU17.6 | 43 | |
| Semantic Segmentation | ADE | mIoU46.4 | 32 | |
| Joint Entity and Relation Extraction | ADE | Entity F1 Score0.912 | 26 | |
| Relation Extraction | ADE | Relation Strict F183.9 | 20 | |
| Named Entity Recognition | ADE (test) | F1 Score96.31 | 19 | |
| Relation Extraction | ADE (test) | Macro F190 | 13 | |
| Open-Vocabulary Semantic Segmentation | ADE-150 | mIoU33.5 | 11 | |
| Segmentation | ADE (out-domain) | PQ26.8 | 10 | |
| Interactive Segmentation | ADE | IoU (Point)66.4 | 9 | |
| Relation Extraction | ADE corpus | F1 Score84.45 | 8 | |
| Named Entity Recognition | ADE exact match (10-fold cross val) | Macro-F191.3 | 7 | |
| Classification | ADE | F1 Score91.03 | 7 | |
| Semantic Image Synthesis | ADE-outd. (val) | FID48.6 | 7 | |
| Relation Extraction | ADE 10-fold cross-validation | Macro F183.2 | 7 | |
| Open-vocabulary Segmentation | ADE-OV | mIoU23.7 | 6 | |
| Named Entity Recognition | ADE | NER Accuracy91.5 | 6 | |
| Entity Recognition | ADE 10-fold cross-validation | F1 Score0.8711 | 6 | |
| Object Detection | ADE-150 (val) | AP (Box)29.6 | 5 | |
| Unconditional Image Synthesis | ADE (Indoor) | FID6.7 | 5 | |
| Relation Extraction | ADE 10-fold cross-validation v1 (test) | Precision80.51 | 3 | |
| Named Entity Recognition | ADE 10-fold cross-validation v1 (test) | Precision0.8926 | 3 | |
| Semantic Image Synthesis | ADE-Indoor (val) | User Preference Score2.49 | 3 | |
| Controllable Image Generation | ADE20k | Sudden Convergence Steps8.3 | 2 | |
| Object Removal | ADE20k | SSIM0.491 | 2 |