| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Object Detection | D3 | Full Score42 | 35 | |
| Classification | D3 | Mean Accuracy76.954 | 30 | |
| Described Object Detection | D3 (Abs) | mAP31.5 | 16 | |
| Described Object Detection | D3 (Pres) | mAP32.9 | 16 | |
| Described Object Detection | D3 (Full) | mAP32.5 | 16 | |
| Described Object Detection | D3 XL | mAP25.4 | 14 | |
| Described Object Detection | D3 L | mAP31.3 | 14 | |
| Described Object Detection | D3 (M) | mAP35.3 | 14 | |
| Described Object Detection | D3 (S) | mAP35.5 | 14 | |
| Regression | D3 | Average Relative MSE0.023 | 11 | |
| Classification | D3 0.15 (test) | Mean Accuracy77.126 | 10 | |
| Diverse Object Detection | D3 (Inter-scenario) | mAP (FULL)5.7 | 10 | |
| Diverse Object Detection | D3 (Intra-scenario) | mAP (FULL)21.6 | 10 | |
| Visual Grounding | D3 (Inter-scenario) | APb (Full)2,100 | 10 | |
| Visual Grounding | D3 Intra-scenario | APb (Full)37.5 | 10 | |
| Aspect-level sentiment classification | D3 | Accuracy81.3 | 9 | |
| Time-Domain Prediction | D3 | NMSE (dB)-15.52 | 6 | |
| Segmentation | D3 trained on D0 (test) | Dice Score94.82 | 5 | |
| Segmentation | D3 evaluated after training on D2 | Dice94.43 | 5 | |
| Knee cartilage segmentation | D3 | Dice93.09 | 5 | |
| Reliability Assessment | D3 (test) | AU-ARC94.56 | 5 | |
| Text-to-image generation | D3 (test) | LCM Text Alignment0.5236 | 5 | |
| Frequency-Domain Prediction | D3 | NMSE (dB)-9.03 | 5 | |
| People counting | D3 unseen environment full dataset (evaluation) | AP55 | 4 | |
| Underwater Image Restoration | D3 | ΔE0027.82 | 4 |