| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Scene Graph Generation | Open Images V6 (test) | wmAPrel56.38 | 74 | |
| Object Detection | Open Images V7 | Latency (us)95.1 | 30 | |
| Multi-label classification | Open Images | mAP84.5 | 24 | |
| Multi-label Classification | Open Images (test) | mAP85 | 16 | |
| Feature Attribution Evaluation | Open Images 5000 random images (val) | AUC0.719 | 13 | |
| Object Detection | Open Images unseen classes (non-overlapping) | AR@1000 (Overall)21 | 11 | |
| Multi-label classification | Open Images v4 (test) | Precision (K=10)35.3 | 10 | |
| Attribution Quality Evaluation | Open Images (val) | SIC AUC0.866 | 10 | |
| Multi-label classification | Open Images (val) | mAP58.11 | 9 | |
| Instance Segmentation | Open Images (test) | mAP50 (Constrained Novel)35.9 | 8 | |
| Lossless Image Compression | Open Images (val) | BPSP2.867 | 7 | |
| Visual Question Answering | Open Images cross-task (test) | Accuracy44.7 | 5 | |
| Visual Reasoning | Open Images (test) | Accuracy85.1 | 5 | |
| Scene Graph Detection | Open Images V6 | mR5040.7 | 5 | |
| Text-Label Classification | Open Images 3756 text labels | mAP82.52 | 4 | |
| Multi-label classification | Open Images v6 (test) | mAP86.8 | 4 | |
| Multi-label classification | Open Images v6 | mAP (C)87.34 | 4 | |
| Object Detection | Open Images 2.4K fashion photos V4 (test) | mAP72.7 | 4 | |
| Multi-label Generalized Zero-Shot Classification | Open Images proposed | P@1033.6 | 3 | |
| Multi-label Zero-Shot Classification | Open Images proposed (7186/367) | Precision @ K=33.5 | 3 | |
| Controlled Trace Generation | Open Images Localized Narratives | LBM (k=0)0.212 | 3 | |
| Multi-label Classification | Open-Images v4 (7186/400) (unseen) | mAP (Mean Average Precision)81.4 | 3 | |
| Instance Segmentation | Open Images (novel) | mAP50 (Constrained Novel)- | 0 |