Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Spherical Regression: Learning Viewpoints, Surface Normals and 3D Rotations on n-Spheres

About

Many computer vision challenges require continuous outputs, but tend to be solved by discrete classification. The reason is classification's natural containment within a probability $n$-simplex, as defined by the popular softmax activation function. Regular regression lacks such a closed geometry, leading to unstable training and convergence to suboptimal local minima. Starting from this insight we revisit regression in convolutional neural networks. We observe many continuous output problems in computer vision are naturally contained in closed geometrical manifolds, like the Euler angles in viewpoint estimation or the normals in surface normal estimation. A natural framework for posing such continuous output problems are $n$-spheres, which are naturally closed geometric manifolds defined in the $\mathbb{R}^{(n+1)}$ space. By introducing a spherical exponential mapping on $n$-spheres at the regression output, we obtain well-behaved gradients, leading to stable training. We show how our spherical regression can be utilized for several computer vision challenges, specifically viewpoint estimation, surface normal estimation and 3D rotation estimation. For all these problems our experiments demonstrate the benefit of spherical regression. All paper resources are available at https://github.com/leoshine/Spherical_Regression.

Shuai Liao, Efstratios Gavves, Cees G. M. Snoek• 2019

Related benchmarks

TaskDatasetResultRank
Viewpoint EstimationPascal3D+ Car (test)
Median Error12.3
12
Pose EstimationPascal3D+ chair (test)
Median Angular Error (°)30.8
12
Relative Camera Pose EstimationMatterport-A (test)
R Mean Error (°)10.68
10
Relative Camera Pose EstimationMatterport-B (test)
Rotation Mean Error (deg)32.94
10
Rotation PredictionPASCAL3D+ (test)
Average Rotation Error13
10
Rotation PredictionModelNet10-SO(3) (test)
Avg Rotation Error36.5
9
Showing 6 of 6 rows

Other info

Follow for update