Maximizing Self-supervision from Thermal Image for Effective Self-supervised Learning of Depth and Ego-motion

About

Recently, self-supervised learning of depth and ego-motion from thermal images shows strong robustness and reliability under challenging scenarios. However, the inherent thermal image properties such as weak contrast, blurry edges, and noise hinder to generate effective self-supervision from thermal images. Therefore, most research relies on additional self-supervision sources such as well-lit RGB images, generative models, and Lidar information. In this paper, we conduct an in-depth analysis of thermal image characteristics that degenerates self-supervision from thermal images. Based on the analysis, we propose an effective thermal image mapping method that significantly increases image information, such as overall structure, contrast, and details, while preserving temporal consistency. The proposed method shows outperformed depth and pose results than previous state-of-the-art networks without leveraging additional RGB guidance.

Ukcheol Shin, Kyunghyun Lee, Byeong-Uk Lee, In So Kweon• 2022

Related benchmarks

Task	Dataset	Result
Monocular Visual Odometry	VIVID Mean over sequences	ATE RMSE1.07	20
Monocular Visual Odometry	VIVID in_rob_local	ATE RMSE0.29	18
Monocular Visual Odometry	VIVID in_rob_global	ATE RMSE0.27	17
Monocular Visual Odometry	VIVID in_unst_local	ATE RMSE0.38	17
Monocular Visual Odometry	VIVID in_rob_dark	ATE RMSE0.42	16
Monocular Visual Odometry	VIVID in_unst_global	ATE RMSE0.54	15
Monocular Visual Odometry	VIVID in_agg_global	ATE RMSE0.65	14
Monocular Visual Odometry	VIVID in_unst_dark	ATE RMSE0.38	13
Monocular Visual Odometry	VIVID in_agg_dark	ATE RMSE0.7	12
Monocular Odometry	RRXIO Visual 1.0	Error (Mocap Easy)1.53	10

Showing 10 of 11 rows

Other info

Follow for update

@wizwand_team Discord