Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

KAN-SAM: Kolmogorov-Arnold Network Guided Segment Anything Model for RGB-T Salient Object Detection

About

Existing RGB-thermal salient object detection (RGB-T SOD) methods aim to identify visually significant objects by leveraging both RGB and thermal modalities to enable robust performance in complex scenarios, but they often suffer from limited generalization due to the constrained diversity of available datasets and the inefficiencies in constructing multi-modal representations. In this paper, we propose a novel prompt learning-based RGB-T SOD method, named KAN-SAM, which reveals the potential of visual foundational models for RGB-T SOD tasks. Specifically, we extend Segment Anything Model 2 (SAM2) for RGB-T SOD by introducing thermal features as guiding prompts through efficient and accurate Kolmogorov-Arnold Network (KAN) adapters, which effectively enhance RGB representations and improve robustness. Furthermore, we introduce a mutually exclusive random masking strategy to reduce reliance on RGB data and improve generalization. Experimental results on benchmarks demonstrate superior performance over the state-of-the-art methods.

Xingyuan Li, Ruichao Hou, Tongwei Ren, Gangshan Wu• 2025

Related benchmarks

TaskDatasetResultRank
RGB-D Video Salient Object DetectionDVisal
S_alpha83.5
14
RGB-D Video Salient Object DetectionViDSOD-100
S_alpha89.2
14
RGB-D Video Salient Object DetectionRDVS
S_alpha85.4
14
Showing 3 of 3 rows

Other info

Follow for update