PST900: RGB-Thermal Calibration, Dataset and Segmentation Network
About
In this work we propose long wave infrared (LWIR) imagery as a viable supporting modality for semantic segmentation using learning-based techniques. We first address the problem of RGB-thermal camera calibration by proposing a passive calibration target and procedure that is both portable and easy to use. Second, we present PST900, a dataset of 894 synchronized and calibrated RGB and Thermal image pairs with per pixel human annotations across four distinct classes from the DARPA Subterranean Challenge. Lastly, we propose a CNN architecture for fast semantic segmentation that combines both RGB and Thermal imagery in a way that leverages RGB imagery independently. We compare our method against the state-of-the-art and show that our method outperforms them in our dataset.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | MFNet (test) | mIoU48.4 | 134 | |
| Semantic segmentation | PST900 (test) | mIoU68.4 | 72 | |
| Semantic segmentation | PST900 | mIoU68.4 | 30 | |
| Semantic segmentation | MFNet day-night (test) | Car IoU76.8 | 20 | |
| Semantic segmentation | PST900 (evaluation) | Background98.9 | 19 | |
| Semantic segmentation | MFNet 1.0 (test) | Car IoU76.8 | 18 | |
| Semantic segmentation | MFNet Dataset (test) | Background IoU97.01 | 14 | |
| Semantic segmentation | MF day-night 11 (evaluation set) | Unlabeled IoU97 | 12 | |
| Semantic segmentation | MFNet (test) | Params (M)31.04 | 9 | |
| Semantic segmentation | PST900 (val) | mIoU68.4 | 9 |