Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching

About

In this paper, we address the challenge of matching semantically similar keypoints across image pairs. Existing research indicates that the intermediate output of the UNet within the Stable Diffusion (SD) can serve as robust image feature maps for such a matching task. We demonstrate that by employing a basic prompt tuning technique, the inherent potential of Stable Diffusion can be harnessed, resulting in a significant enhancement in accuracy over previous approaches. We further introduce a novel conditional prompting module that conditions the prompt on the local details of the input image pairs, leading to a further improvement in performance. We designate our approach as SD4Match, short for Stable Diffusion for Semantic Matching. Comprehensive evaluations of SD4Match on the PF-Pascal, PF-Willow, and SPair-71k datasets show that it sets new benchmarks in accuracy across all these datasets. Particularly, SD4Match outperforms the previous state-of-the-art by a margin of 12 percentage points on the challenging SPair-71k dataset.

Xinghui Li, Jingyi Lu, Kai Han, Victor Prisacariu• 2023

Related benchmarks

TaskDatasetResultRank
Semantic CorrespondenceSPair-71k (test)
PCK@0.175.5
122
Semantic CorrespondencePF-WILLOW
PCK@0.1 (bbox)81.3
109
Semantic CorrespondencePF-Pascal (test)
PCK@0.195.2
106
Semantic CorrespondencePF-PASCAL
PCK @ alpha=0.195.2
98
Semantic CorrespondenceSPair-71k
Φ_bbox @ α=0.175.5
29
Semantic MatchingSPair-71k 1.0 (test)
PCK@0.1 (Aero)75.3
16
Semantic MatchingSPair-71k
PCK@0.0559.5
14
Semantic MatchingSPair-71k
PCK @ alpha_bbox (0.1)75.5
9
Semantic CorrespondenceSPair-71k (ablation)
Phi (bbox, alpha=0.1)75.5
6
Showing 9 of 9 rows

Other info

Follow for update