| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Classification | NWPU (test) | OA0.9506 | 80 | |
| Few-Shot Classification | NWPU | Accuracy93.25 | 54 | |
| Scene Classification | NWPU 20% training ratio 45 classes (test) | Overall Accuracy95.82 | 45 | |
| Scene Classification | NWPU | Top-1 Acc96.92 | 38 | |
| Scene Classification | NWPU 10/90 split | Accuracy94.56 | 21 | |
| Crowd Counting | NWPU (test) | MAE46.23 | 15 | |
| Crowd Counting | NWPU 49 | MAE71.7 | 13 | |
| Object Detection | NWPU | mAP5093.3 | 12 | |
| Object Detection | NWPU-10 (test) | Recall @ IoU=0.591.7 | 10 | |
| Crowd Counting | NWPU | MAE46.23 | 9 | |
| Crowd Counting | NWPU (val) | MAE70.5 | 8 | |
| Scene Classification | NWPU RESISC45 (test) | Top-1 Accuracy96.17 | 6 | |
| Image Segmentation | NWPU | mIoU84 | 6 | |
| Question Generation | NWPU-300 (test) | BLEU-141.87 | 5 | |
| Dataset Distillation | NWPU | Training Time (h)13.17 | 5 | |
| Image Captioning | NWPU | BLEU-190.9 | 4 | |
| Text-to-Image Retrieval | NWPU | Avg R@1/5/1037.74 | 4 | |
| Image-to-Text Retrieval | NWPU | Avg Top-1/5/1045.12 | 4 |