Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SAMSOD: Rethinking SAM Optimization for RGB-T Salient Object Detection

About

RGB-T salient object detection (SOD) aims to segment attractive objects by combining RGB and thermal infrared images. To enhance performance, the Segment Anything Model has been fine-tuned for this task. However, the imbalance convergence of two modalities and significant gradient difference between high- and low- activations are ignored, thereby leaving room for further performance enhancement. In this paper, we propose a model called \textit{SAMSOD}, which utilizes unimodal supervision to enhance the learning of non-dominant modality and employs gradient deconfliction to reduce the impact of conflicting gradients on model convergence. The method also leverages two decoupled adapters to separately mask high- and low-activation neurons, emphasizing foreground objects by enhancing background learning. Fundamental experiments on RGB-T SOD benchmark datasets and generalizability experiments on scribble supervised RGB-T SOD, fully supervised RGB-D SOD datasets and full-supervised RGB-D rail surface defect detection all demonstrate the effectiveness of our proposed method.

Zhengyi Liu, Xinrui Wang, Xianyong Fang, Zhengzheng Tu, Linbo Wang• 2025

Related benchmarks

TaskDatasetResultRank
RGB-T Salient Object DetectionVT1000
S-Measure (S)94.2
42
RGB-T Salient Object DetectionVT821
S Score0.913
42
RGB-T Salient Object DetectionVT5000 (test)
Sm Score92.3
39
RGB-T Salient Object DetectionVT1000 (test)
S-Measure94.2
39
RGB-T Salient Object DetectionVT821 (test)
Sm0.913
39
RGB-T Salient Object DetectionVT5000
Score (M)2.1
28
Showing 6 of 6 rows

Other info

Follow for update