| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Semantic Segmentation | VOC-20 | mIoU97.7 | 118 | |
| Semantic Segmentation | VOC21 | mIoU84.6 | 108 | |
| Image Classification | VOC 2007 | Top-1 Accuracy92.6 | 85 | |
| Object Detection | VOC 2007 (test) | AP@5085.4 | 84 | |
| Semantic Segmentation | VOC 2012 (val) | mIoU91.9 | 76 | |
| Multi-label Image Classification | VOC 2012 (test) | mAP96.6 | 72 | |
| Object Detection | VOC 2012 (test) | mAP@.5084.5 | 69 | |
| Image Classification | VOC 2007 (test) | mAP91.9 | 67 | |
| Multi-label image recognition | VOC 2007 (test) | mAP96.8 | 61 | |
| Multi-label Classification | VOC 07 | mAP97 | 61 | |
| Semantic Segmentation | VOC | mIoU85.21 | 55 | |
| Semantic Segmentation | VOC 2012 | mIoU97.7 | 52 | |
| Pointing localization | VOC 2007 (test) | Mean Accuracy (All)94.2 | 44 | |
| Object Detection | VOC 07+12 (test) | APall79.3 | 38 | |
| Unsupervised Single Object Discovery | VOC 2012 (test) | CorLoc81.5 | 34 | |
| Unsupervised Single Object Discovery | VOC 2007 (test) | CorLoc77.5 | 34 | |
| Object Detection | VOC7+12 (val) | mAP54 | 33 | |
| Object Detection | VOC to Watercolor (target) | mAP64.2 | 31 | |
| Classification | VOC 2007 | Accuracy95 | 31 | |
| Object Detection | VOC 07+12 | AP@5084.2 | 30 | |
| Object Detection | VOC0712 | AP84.9 | 29 | |
| Classification | VOC07 (test) | Accuracy94 | 29 | |
| Semantic Segmentation | VOC 21 (val) | mIoU76.4 | 28 | |
| Object-Centric Representation Learning | VOC | mBOi58.9 | 28 | |
| Image Classification | VOC 07 | mAP89.3 | 27 |