| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Object Detection | FLIR (test) | mAP502.39 | 94 | |
| Object Detection | FLIR | mAP87 | 65 | |
| Object Detection | FLIR Aligned (test) | mAP@0.587 | 40 | |
| RGB-T Pedestrian Detection Attack | FLIR aligned (test) | ASR100 | 30 | |
| Infrared-Visible Image Fusion | FLIR | AG5.079 | 22 | |
| Object Detection | FLIR Aligned | AP48.1 | 20 | |
| Object Detection | FLIR v2 | AP20 | 15 | |
| Text-to-Image Retrieval | FLIR (test) | Recall@111.7 | 11 | |
| Image-to-Text Retrieval | FLIR (test) | Recall@112.3 | 11 | |
| Image Fusion | FLIR PID-generated infrared | AG (Average Gradient)4.518 | 11 | |
| Object Detection | FLIR relabeled version by Zhang (test) | mAP43.8 | 11 | |
| Object Detection | FLIR v1_3 | ASR20.2 | 9 | |
| Image-to-Image Translation | FLIR | PSNR17.74 | 9 | |
| Infrared and Visible Image Fusion | FLIR image fusion | EN7.344 | 9 | |
| Object Detection | FLIR 10-shot (test) | mAP5071.15 | 8 | |
| Object Detection | FLIR 5-shot (test) | mAP5070.69 | 8 | |
| RGB-to-LWIR translation | FLIR 2024 (test) | PSNR23.45 | 7 | |
| Thermal-to-Text Retrieval (via Vision pivot) | FLIR to LLVIP (test) | mAP45.2 | 6 | |
| Multi-category object detection | FLIR RGB + IR (test) | AP5086.4 | 4 | |
| Infrared Image Classification | FLIR V2 | Top-1 Accuracy97.2 | 4 | |
| Infrared Image Classification | FLIR V1 | Top-1 Accuracy82.9 | 2 | |
| Out-of-Distribution Detection | FLIR VIS | AUROC (Level 1)82.83 | 1 |