Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Small but Mighty: Dynamic Wavelet Expert-Guided Fine-Tuning of Large-Scale Models for Optical Remote Sensing Object Segmentation

About

Accurately localizing and segmenting relevant objects from optical remote sensing images (ORSIs) is critical for advancing remote sensing applications. Existing methods are typically built upon moderate-scale pre-trained models and employ diverse optimization strategies to achieve promising performance under full-parameter fine-tuning. In fact, deeper and larger-scale foundation models can provide stronger support for performance improvement. However, due to their massive number of parameters, directly adopting full-parameter fine-tuning leads to pronounced training difficulties, such as excessive GPU memory consumption and high computational costs, which result in extremely limited exploration of large-scale models in existing works. In this paper, we propose a novel dynamic wavelet expert-guided fine-tuning paradigm with fewer trainable parameters, dubbed WEFT, which efficiently adapts large-scale foundation models to ORSIs segmentation tasks by leveraging the guidance of wavelet experts. Specifically, we introduce a task-specific wavelet expert extractor to model wavelet experts from different perspectives and dynamically regulate their outputs, thereby generating trainable features enriched with task-specific information for subsequent fine-tuning. Furthermore, we construct an expert-guided conditional adapter that first enhances the fine-grained perception of frozen features for specific tasks by injecting trainable features, and then iteratively updates the information of both types of feature, allowing for efficient fine-tuning. Extensive experiments show that our WEFT not only outperforms 21 state-of-the-art (SOTA) methods on three ORSIs datasets, but also achieves optimal results in camouflage, natural, and medical scenarios. The source code is available at: https://github.com/CSYSI/WEFT.

Yanguang Sun, Chao Wang, Jian Yang, Lei Luo• 2026

Related benchmarks

TaskDatasetResultRank
Polyp SegmentationKvasir-SEG 100 random images
Dice Coefficient93.29
27
Object SegmentationORSSD 200 images
mIoU89.64
22
Object SegmentationEORSSD 600 images
mIoU86.21
22
Object SegmentationORSIs-4199 (2199 images)
mIoU79.99
18
Camouflaged Object DetectionCAMO 250 images
mIoU83.08
7
Camouflaged Object DetectionCOD10K 2026 images
mIoU79.84
7
Camouflaged Object DetectionNC4K
mIoU83.62
7
Polyp SegmentationCVC-300 62 images
mIoU85.02
7
Salient Object DetectionPASCAL-S 850 images
mIoU83.59
7
Showing 9 of 9 rows

Other info

Follow for update