Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

End-to-end weakly-supervised semantic alignment

About

We tackle the task of semantic alignment where the goal is to compute dense semantic correspondence aligning two images depicting objects of the same category. This is a challenging task due to large intra-class variation, changes in viewpoint and background clutter. We present the following three principal contributions. First, we develop a convolutional neural network architecture for semantic alignment that is trainable in an end-to-end manner from weak image-level supervision in the form of matching image pairs. The outcome is that parameters are learnt from rich appearance variation present in different but semantically related images without the need for tedious manual annotation of correspondences at training time. Second, the main component of this architecture is a differentiable soft inlier scoring module, inspired by the RANSAC inlier scoring procedure, that computes the quality of the alignment based on only geometrically consistent correspondences thereby reducing the effect of background clutter. Third, we demonstrate that the proposed approach achieves state-of-the-art performance on multiple standard benchmarks for semantic alignment.

Ignacio Rocco, Relja Arandjelovi\'c, Josef Sivic• 2017

Related benchmarks

TaskDatasetResultRank
Semantic CorrespondenceSPair-71k (test)
PCK@0.120.9
122
Semantic CorrespondencePF-WILLOW
PCK@0.1 (bbox)70.2
109
Semantic CorrespondencePF-Pascal (test)
PCK@0.174.8
106
Semantic CorrespondencePF-PASCAL
PCK @ alpha=0.174.8
98
Keypoint TransferSPair-71k (test)
Bicycle17.6
38
Semantic CorrespondencePF-WILLOW (test)--
37
Semantic keypoint transferPF-Pascal (test)
PCK @ 0.0549
35
Semantic CorrespondenceCaltech-101
LT-ACC85
31
Semantic CorrespondencePF-PASCAL
PCK@0.174.8
29
Semantic CorrespondenceSPair-71k
PCK@0.121.1
24
Showing 10 of 13 rows

Other info

Follow for update