Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ObjectRelator: Enabling Cross-View Object Relation Understanding Across Ego-Centric and Exo-Centric Perspectives

About

Bridging the gap between ego-centric and exo-centric views has been a long-standing question in computer vision. In this paper, we focus on the emerging Ego-Exo object correspondence task, which aims to understand object relations across ego-exo perspectives through segmentation. While numerous segmentation models have been proposed, most operate on a single image (view), making them impractical for cross-view scenarios. PSALM, a recently proposed segmentation method, stands out as a notable exception with its demonstrated zero-shot ability on this task. However, due to the drastic viewpoint change between ego and exo, PSALM fails to accurately locate and segment objects, especially in complex backgrounds or when object appearances change significantly. To address these issues, we propose ObjectRelator, a novel approach featuring two key modules: Multimodal Condition Fusion (MCFuse) and SSL-based Cross-View Object Alignment (XObjAlign). MCFuse introduces language as an additional cue, integrating both visual masks and textual descriptions to improve object localization and prevent incorrect associations. XObjAlign enforces cross-view consistency through self-supervised alignment, enhancing robustness to object appearance variations. Extensive experiments demonstrate ObjectRelator's effectiveness on the large-scale Ego-Exo4D benchmark and HANDAL-X (an adapted dataset for cross-view segmentation) with state-of-the-art performance. Code is made available at: http://yuqianfu.com/ObjectRelator.

Yuqian Fu, Runze Wang, Bin Ren, Guolei Sun, Biao Gong, Yanwei Fu, Danda Pani Paudel, Xuanjing Huang, Luc Van Gool• 2024

Related benchmarks

TaskDatasetResultRank
Cross-view Instance SegmentationEgo-Exo4D Ego-to-Exo
IoU45.4
15
Cross-view Instance SegmentationEgo-Exo4D Exo-to-Ego
IoU50.9
15
Ego-to-Exo object correspondenceEgo-Exo4D Correspondences v2 (test)
IoU35.3
11
Exo-to-Ego object correspondenceEgo-Exo4D Correspondences v2 (test)
IoU40.3
11
Cross-view Object CorrespondenceEgo-Exo4D v2 (test)--
11
Cross-view Object SegmentationHANDAL-X
IoU42.8
7
Showing 6 of 6 rows

Other info

Follow for update