CTRL-C: Camera calibration TRansformer with Line-Classification

About

Single image camera calibration is the task of estimating the camera parameters from a single input image, such as the vanishing points, focal length, and horizon line. In this work, we propose Camera calibration TRansformer with Line-Classification (CTRL-C), an end-to-end neural network-based approach to single image camera calibration, which directly estimates the camera parameters from an image and a set of line segments. Our network adopts the transformer architecture to capture the global structure of an image with multi-modal inputs in an end-to-end manner. We also propose an auxiliary task of line classification to train the network to extract the global geometric information from lines effectively. Our experiments demonstrate that CTRL-C outperforms the previous state-of-the-art methods on the Google Street View and SUN360 benchmark datasets.

Jinwoo Lee, Hyunsung Go, Hyunjoon Lee, Sunghyun Cho, Minhyuk Sung, Junho Kim• 2021

Related benchmarks

Task	Dataset	Result
Camera Understanding	MegaDepth	FoV AUC@1°2	31
Camera Understanding	LaMAR	FoV AUC@1°9.8	26
Camera Understanding	TartanAir	FoV AUC@1°10.7	26
Camera Understanding	Stanford2D3D	FoV AUC (Threshold 1°)7.7	26
Perspective Field prediction	Stanford2D3D (test)	Up Mean7.39	12
Perspective Field prediction	TartanAir (test)	Mean Angular Error (Up)7.32	12
Camera Understanding	Puffin Und	Roll Error4.69	7
Camera Parameter Estimation	GSV uncentered (test)	Roll Mean Error1.92	6
Monocular Camera Calibration	GSV dataset (test)	Mean FoV Error (°)3.59	6
Object-centric prediction	Objectron 1.0 (isolated)	Up Mean Error7.49	5

Showing 10 of 12 rows

Other info

Follow for update

@wizwand_team Discord