SFNet: Learning Object-aware Semantic Correspondence

About

We address the problem of semantic correspondence, that is, establishing a dense flow field between images depicting different instances of the same object or scene category. We propose to use images annotated with binary foreground masks and subjected to synthetic geometric deformations to train a convolutional neural network (CNN) for this task. Using these masks as part of the supervisory signal offers a good compromise between semantic flow methods, where the amount of training data is limited by the cost of manually selecting point correspondences, and semantic alignment ones, where the regression of a single global geometric transformation between images may be sensitive to image-specific details such as background clutter. We propose a new CNN architecture, dubbed SFNet, which implements this idea. It leverages a new and differentiable version of the argmax function for end-to-end training, with a loss that combines mask and flow consistency with smoothness terms. Experimental results demonstrate the effectiveness of our approach, which significantly outperforms the state of the art on standard benchmarks.

Junghyup Lee, Dohyung Kim, Jean Ponce, Bumsub Ham• 2019

Related benchmarks

Task	Dataset	Result
Semantic Correspondence	SPair-71k (test)	PCK@0.127.9	146
Semantic Correspondence	PF-PASCAL	PCK @ alpha=0.184	116
Semantic Correspondence	PF-WILLOW	PCK@0.1 (bbox)74	109
Semantic Correspondence	PF-Pascal (test)	PCK@0.192.9	106
Semantic Correspondence	PF-WILLOW (test)	PCK @ 0.10 (bbox)72.5	37
Semantic keypoint transfer	PF-Pascal (test)	PCK @ 0.0553.6	35
Semantic Correspondence	Caltech-101	LT-ACC88	31
Semantic Correspondence	SPair-71k	Φ_bbox @ α=0.124	29
Semantic Matching	TSS (test)	FG3DCar PCK@0.0588	27
Semantic Matching	SPair-71k 1.0 (test)	PCK@0.1 (Aero)26.9	16

Showing 10 of 11 rows

Other info

Follow for update

@wizwand_team Discord