Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images

About

Remote sensing image plays an irreplaceable role in fields such as agriculture, water resources, military, and disaster relief. Pixel-level interpretation is a critical aspect of remote sensing image applications; however, a prevalent limitation remains the need for extensive manual annotation. For this, we try to introduce open-vocabulary semantic segmentation (OVSS) into the remote sensing context. However, due to the sensitivity of remote sensing images to low-resolution features, distorted target shapes and ill-fitting boundaries are exhibited in the prediction mask. To tackle this issue, we propose a simple and general upsampler, SimFeatUp, to restore lost spatial information in deep features in a training-free style. Further, based on the observation of the abnormal response of local patch tokens to [CLS] token in CLIP, we propose to execute a straightforward subtraction operation to alleviate the global bias in patch tokens. Extensive experiments are conducted on 17 remote sensing datasets spanning semantic segmentation, building extraction, road detection, and flood detection tasks. Our method achieves an average of 5.8%, 8.2%, 4.0%, and 15.3% improvement over state-of-the-art methods on 4 tasks. All codes are released. \url{https://earth-insights.github.io/SegEarth-OV}

Kaiyu Li, Ruixun Liu, Xiangyong Cao, Xueru Bai, Feng Zhou, Deyu Meng, Zhi Wang• 2024

Related benchmarks

TaskDatasetResultRank
Semantic segmentationLoveDA
mIoU42.63
142
Semantic segmentationVaihingen
mIoU40
95
Semantic segmentationPotsdam
mIoU48.8
73
Semantic segmentationiSAID
mIoU21.7
68
Open Vocabulary Semantic SegmentationCOCOStuff (val)
mIoU25.1
60
Open Vocabulary Semantic SegmentationCityscapes (val)
mIoU30.7
37
Road SegmentationMassachusetts Road Dataset
IoU (Average)0.172
35
Open Vocabulary Semantic SegmentationPASCAL Context 59 (val)
mIoU37.5
32
Semantic segmentationVDD
mIoU45.3
31
Semantic segmentationUAVid
mIoU42.5
23
Showing 10 of 44 rows

Other info

Code

Follow for update