Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Unorthogonalized Matrices for Rotation Estimation

About

Estimating 3D rotations is a common procedure for 3D computer vision. The accuracy depends heavily on the rotation representation. One form of representation -- rotation matrices -- is popular due to its continuity, especially for pose estimation tasks. The learning process usually incorporates orthogonalization to ensure orthonormal matrices. Our work reveals, through gradient analysis, that common orthogonalization procedures based on the Gram-Schmidt process and singular value decomposition will slow down training efficiency. To this end, we advocate removing orthogonalization from the learning process and learning unorthogonalized `Pseudo' Rotation Matrices (PRoM). An optimization analysis shows that PRoM converges faster and to a better solution. By replacing the orthogonalization incorporated representation with our proposed PRoM in various rotation-related tasks, we achieve state-of-the-art results on large-scale benchmarks for human pose estimation.

Kerui Gu, Zhihao Li, Shiyong Liu, Jianzhuang Liu, Songcen Xu, Youliang Yan, Michael Bi Mi, Kenji Kawaguchi, Angela Yao• 2023

Related benchmarks

TaskDatasetResultRank
3D Human Pose Estimation3DPW (test)
PA-MPJPE29.9
505
3D Human Pose EstimationHuman3.6M
MPJPE43.8
160
3D Human Pose Estimation3DPW
PA-MPJPE42
119
3D Hand Pose EstimationFreiHAND
PA-MPJPE (mm)6.5
24
3D Human Pose and Shape EstimationAGORA
MPJPE65
18
Point cloud pose estimationSynthetic rotation dataset (test)
Mean Error2.13
5
Rotation recoverySynthetic rotation dataset
Mean Angular Error0.37
5
Showing 7 of 7 rows

Other info

Follow for update