Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Brewing Stronger Features: Dual-Teacher Distillation for Multispectral Earth Observation

About

Foundation models are transforming Earth Observation (EO), yet the diversity of EO sensors and modalities makes a single universal model unrealistic. Multiple specialized EO foundation models (EOFMs) will likely coexist, making efficient knowledge transfer across modalities essential. Most existing EO pretraining relies on masked image modeling, which emphasizes local reconstruction but provides limited control over global semantic structure. To address this, we propose a dual-teacher contrastive distillation framework for multispectral imagery that aligns the student's pretraining objective with the contrastive self-distillation paradigm of modern optical vision foundation models (VFMs). Our approach combines a multispectral teacher with an optical VFM teacher, enabling coherent cross-modal representation learning. Experiments across diverse optical and multispectral benchmarks show that our model adapts to multispectral data without compromising performance on optical-only inputs, achieving state-of-the-art results in both settings, with an average improvement of 3.64 percentage points in semantic segmentation, 1.2 in change detection, and 1.31 in classification tasks. This demonstrates that contrastive distillation provides a principled and efficient approach to scalable representation learning across heterogeneous EO data sources. Project page: \textcolor{magenta}{https://wolfilip.github.io/DEO/}.

Filip Wolf, Bla\v{z} Rolih, Luka \v{C}ehovin Zajc• 2026

Related benchmarks

TaskDatasetResultRank
Change DetectionLEVIR
F1 Score91.3
62
Change DetectionOSCD--
26
Semantic segmentationSpaceNet v1
macro mIoU80.83
20
Multi-Label ClassificationGB-BEN
F1 Score51.89
10
Semantic segmentationGEO-Bench SA-c
Macro mIoU28.98
10
Semantic segmentationSen1Floods11
mIoU (macro)94.32
10
Semantic segmentationPASTIS
Macro mIoU28.96
10
Semantic segmentationOptical and Multispectral Segmentation Summary
mIoU (Optical, Macro)81.98
10
Semantic segmentationGeo-Bench
mIoU (nz-cattle, macro)80.21
10
Multispectral ClassificationGEO-Bench m-bigearthnet, m-so2sat, m-eurosat (test)
F1 Score (GB-ben)0.5843
10
Showing 10 of 10 rows

Other info

Follow for update