On the Continuity of Rotation Representations in Neural Networks
About
In neural networks, it is often desirable to work with various representations of the same space. For example, 3D rotations can be represented with quaternions or Euler angles. In this paper, we advance a definition of a continuous representation, which can be helpful for training deep neural networks. We relate this to topological concepts such as homeomorphism and embedding. We then investigate what are continuous and discontinuous representations for 2D, 3D, and n-dimensional rotations. We demonstrate that for 3D rotations, all representations are discontinuous in the real Euclidean spaces of four or fewer dimensions. Thus, widely used representations such as quaternions and Euler angles are discontinuous and difficult for neural networks to learn. We show that the 3D rotations have continuous representations in 5D and 6D, which are more suitable for learning. We also present continuous representations for the general case of the n-dimensional rotation group SO(n). While our main focus is on rotations, we also show that our constructions apply to other groups such as the orthogonal group and similarity transforms. We finally present empirical results, which show that our continuous rotation representations outperform discontinuous ones for several practical problems in graphics and vision, including a simple autoencoder sanity test, a rotation estimator for 3D point clouds, and an inverse kinematics solver for 3D human poses.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Human Pose Estimation | MPI-INF-3DHP (test) | PCK75.3 | 559 | |
| Rotation Estimation | SUN360 Large Overlap | Geodesic Error (Mean)7.18 | 13 | |
| Relative Camera Pose Estimation | Matterport-A (test) | R Mean Error (°)5.73 | 10 | |
| Relative Camera Pose Estimation | Matterport-B (test) | Rotation Mean Error (deg)18.23 | 10 | |
| Rotation Prediction | ModelNet10-SO(3) (test) | Avg Rotation Error41.1 | 9 | |
| Rotation Estimation | StreetLearn Large Overlap | Mean Geodesic Error3.36 | 6 | |
| Rotation Estimation | StreetLearn-T Large Overlap | Mean Geodesic Error12.31 | 6 | |
| Rotation Estimation | InteriorNet Large Overlap | Mean Geodesic Error5.43 | 6 | |
| Rotation Estimation | InteriorNet Small Overlap | Mean Geodesic Error17.83 | 6 | |
| Rotation Estimation | InteriorNet-T Large Overlap | Mean Geodesic Error10.45 | 6 |