Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Explicit Visual Prompting for Low-Level Structure Segmentations

About

We consider the generic problem of detecting low-level structures in images, which includes segmenting the manipulated parts, identifying out-of-focus pixels, separating shadow regions, and detecting concealed objects. Whereas each such topic has been typically addressed with a domain-specific solution, we show that a unified approach performs well across all of them. We take inspiration from the widely-used pre-training and then prompt tuning protocols in NLP and propose a new visual prompting model, named Explicit Visual Prompting (EVP). Different from the previous visual prompting which is typically a dataset-level implicit embedding, our key insight is to enforce the tunable parameters focusing on the explicit visual content from each individual image, i.e., the features from frozen patch embeddings and the input's high-frequency components. The proposed EVP significantly outperforms other parameter-efficient tuning protocols under the same amount of tunable parameters (5.7% extra trainable parameters of each task). EVP also achieves state-of-the-art performances on diverse low-level structure segmentation tasks compared to task-specific solutions. Our code is available at: https://github.com/NiFangBaAGe/Explicit-Visual-Prompt.

Weihuang Liu, Xi Shen, Chi-Man Pun, Xiaodong Cun• 2023

Related benchmarks

TaskDatasetResultRank
Salient Object DetectionDUT-OMRON (test)
MAE0.046
92
Salient Object DetectionECSSD 1,000 images (test)
MAE0.027
48
Salient Object DetectionHKU-IS 4,447 images (test)
MAE0.023
42
Co-Saliency DetectionCoSOD3k (test)
Fmax0.813
41
Salient Object DetectionDUTS 5019 (test)
Mean Absolute Error (MAE)0.026
29
Co-Salient Object DetectionCoCA (test)
Fmax0.546
28
Co-Salient Object DetectionCoSal 2015 (test)
Sm87.6
23
Defocus Blur DetectionCUHK 100 images (test)
F-beta Score0.932
8
Defocus Blur DetectionDUT 500 images (test)
F_beta0.89
8
Forgery DetectionCASIA 1.0
F1 Score0.654
8
Showing 10 of 12 rows

Other info

Code

Follow for update