Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SCAPO: Self-Supervised Category-Level Articulated Pose Estimation from a Single 3D Observation

About

Existing methods for category-level object articulation from a single 3D observation often rely on dense supervision, multi-frame inputs, or CAD templates, and still struggle to disentangle geometry from articulation or to recover explicit joint parameters. We propose SCAPO, a self-supervised framework that estimates canonical geometry, rigid part segmentation, and joint pivots, axes, and articulation states from a single RGB-D observation without ground-truth labels or category-specific models. Our SCAPO first uses an SE(3)-equivariant vector-neuron autoencoder to factor out global pose and align diverse instances into a shared canonical space. On this aligned shape, a joint-aware blend-skinning module is then designed to model part motion. We learn this representation through cycle reconstruction between observed and canonical shapes and cross-space alignment with a learnable canonical template that decouples shared category geometry from instance-specific residual shape. Experiments on synthetic and real articulated-object datasets show that our SCAPO recovers consistent part structure and accurate articulation parameters and outperforms all self-supervised baselines.

Can Zhang, Gim Hee Lee• 2026

Related benchmarks

TaskDatasetResultRank
Articulated Joint EstimationReal-world dataset
mAP (5° 5cm)92.23
24
Part SegmentationReal-world dataset
mIoU (75%)98.79
24
Part-level Pose EstimationReal-world dataset
mAP (5° 5cm)37.38
24
Part Pose EstimationHOI4D synthetic (test)
Mean Part Rotation Error (R)2.17
18
Joint parameter estimationHOI4D synthetic (test)
Joint Direction Error (D)1.22
15
Part SegmentationHOI4D synthetic (test)
Part Segmentation IoU93.41
15
Part Pose EstimationShape2Motion synthetic (test)
Mean Part Rotation Error (R)5.6
12
Joint parameter estimationShape2Motion synthetic (test)
Joint Direction Error (D)4.7
10
Part SegmentationShape2Motion synthetic (test)
Segmentation IoU (Part)93.1
10
Showing 9 of 9 rows

Other info

Follow for update