OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation

About

Category-level articulated object pose estimation focuses on the pose estimation of unknown articulated objects within known categories. Despite its significance, this task remains challenging due to the varying shapes and poses of objects, expensive dataset annotation costs, and complex real-world environments. In this paper, we propose a novel self-supervised approach that leverages a single-frame point cloud to solve this task. Our model consistently generates reconstruction with a canonical pose and joint state for the entire input object, and it estimates object-level poses that reduce overall pose variance and part-level poses that align each part of the input with its corresponding part of the reconstruction. Experimental results demonstrate that our approach significantly outperforms previous self-supervised methods and is comparable to the state-of-the-art supervised methods. To assess the performance of our model in real-world scenarios, we also introduce a new real-world articulated object benchmark dataset.

Yuchen Che, Ryo Furukawa, Asako Kanezaki• 2024

Related benchmarks

Task	Dataset	Result
Articulated Joint Estimation	Real-world dataset	mAP (5° 5cm)91.17	24
Part Segmentation	Real-world dataset	mIoU (75%)97.82	24
Part-level Pose Estimation	Real-world dataset	mAP (5° 5cm)36.17	24
Part Pose Estimation	HOI4D synthetic (test)	Mean Part Rotation Error (R)2.87	18
Joint parameter estimation	HOI4D synthetic (test)	Joint Direction Error (D)1.34	15
Part Segmentation	HOI4D synthetic (test)	Part Segmentation IoU92.38	15
Part Pose Estimation	Shape2Motion synthetic (test)	Mean Part Rotation Error (R)5.76	12
Joint parameter estimation	Shape2Motion synthetic (test)	Joint Direction Error (D)4.41	10
Part Segmentation	Shape2Motion synthetic (test)	Segmentation IoU (Part)90.84	10

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord