Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Balanced Diffusion-Guided Fusion for Multimodal Remote Sensing Classification

About

Deep learning-based techniques for the analysis of multimodal remote sensing data have become popular due to their ability to effectively integrate complementary spatial, spectral, and structural information from different sensors. Recently, denoising diffusion probabilistic models (DDPMs) have attracted attention in the remote sensing community due to their powerful ability to capture robust and complex spatial-spectral distributions. However, pre-training multimodal DDPMs may result in modality imbalance, and effectively leveraging diffusion features to guide complementary diversity feature extraction remains an open question. To address these issues, this paper proposes a balanced diffusion-guided fusion (BDGF) framework that leverages multimodal diffusion features to guide a multi-branch network for land-cover classification. Specifically, we propose an adaptive modality masking strategy to encourage the DDPMs to obtain a modality-balanced rather than spectral image-dominated data distribution. Subsequently, these diffusion features hierarchically guide feature extraction among CNN, Mamba, and transformer networks by integrating feature fusion, group channel attention, and cross-attention mechanisms. Finally, a mutual learning strategy is developed to enhance inter-branch collaboration by aligning the probability entropy and feature similarity of individual subnetworks. Extensive experiments on four multimodal remote sensing datasets demonstrate that the proposed method achieves superior classification performance. The code is available at https://github.com/HaoLiu-XDU/BDGF.

Hao Liu, Yongjie Zheng, Yuhan Kang, Mingyang Zhang, Maoguo Gong, Lorenzo Bruzzone• 2025

Related benchmarks

TaskDatasetResultRank
Remote Sensing Image ClassificationAugsburg
Parameters (M)4.3
20
Remote Sensing Image ClassificationYellow River Estuary
Params (M)4.3
20
Remote Sensing Image ClassificationLCZ HK
Params (M)4.3
20
Multimodal Remote Sensing ClassificationYellow River Estuary
Overall Accuracy (OA)79.55
12
Remote Sensing Image ClassificationBerlin
Model Parameters (M)4.3
12
Multimodal Remote Sensing ClassificationLCZ HK 50 samples per class (train)
Class 1 Accuracy80.77
10
Multimodal Remote Sensing ClassificationAugsburg HSI+SAR (test)
Class Accuracy 197.03
10
Multimodal Remote Sensing ClassificationBerlin 100 samples per class (train)
Class 1 Accuracy82.34
10
Showing 8 of 8 rows

Other info

Follow for update