Maximizing Self-supervision from Thermal Image for Effective Self-supervised Learning of Depth and Ego-motion
About
Recently, self-supervised learning of depth and ego-motion from thermal images shows strong robustness and reliability under challenging scenarios. However, the inherent thermal image properties such as weak contrast, blurry edges, and noise hinder to generate effective self-supervision from thermal images. Therefore, most research relies on additional self-supervision sources such as well-lit RGB images, generative models, and Lidar information. In this paper, we conduct an in-depth analysis of thermal image characteristics that degenerates self-supervision from thermal images. Based on the analysis, we propose an effective thermal image mapping method that significantly increases image information, such as overall structure, contrast, and details, while preserving temporal consistency. The proposed method shows outperformed depth and pose results than previous state-of-the-art networks without leveraging additional RGB guidance.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Monocular Visual Odometry | VIVID Mean over sequences | ATE RMSE1.07 | 20 | |
| Monocular Visual Odometry | VIVID in_rob_local | ATE RMSE0.29 | 18 | |
| Monocular Visual Odometry | VIVID in_rob_global | ATE RMSE0.27 | 17 | |
| Monocular Visual Odometry | VIVID in_unst_local | ATE RMSE0.38 | 17 | |
| Monocular Visual Odometry | VIVID in_rob_dark | ATE RMSE0.42 | 16 | |
| Monocular Visual Odometry | VIVID in_unst_global | ATE RMSE0.54 | 15 | |
| Monocular Visual Odometry | VIVID in_agg_global | ATE RMSE0.65 | 14 | |
| Monocular Visual Odometry | VIVID in_unst_dark | ATE RMSE0.38 | 13 | |
| Monocular Visual Odometry | VIVID in_agg_dark | ATE RMSE0.7 | 12 | |
| Monocular Odometry | RRXIO Visual 1.0 | Error (Mocap Easy)1.53 | 10 |