Deep Perceptual Mapping for Cross-Modal Face Recognition
About
Cross modal face matching between the thermal and visible spectrum is a much desired capability for night-time surveillance and security applications. Due to a very large modality gap, thermal-to-visible face recognition is one of the most challenging face matching problem. In this paper, we present an approach to bridge this modality gap by a significant margin. Our approach captures the highly non-linear relationship between the two modalities by using a deep neural network. Our model attempts to learn a non-linear mapping from visible to thermal spectrum while preserving the identity information. We show substantive performance improvement on three difficult thermal-visible face datasets. The presented approach improves the state-of-the-art by more than 10\% on UND-X1 dataset and by more than 15-30\% on NVESD dataset in terms of Rank-1 identification. Our method bridges the drop in performance due to the modality gap by more than 40\%.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Cross-Modal Rank-1 Identification | UND X1 | Rank-1 Identification Accuracy83.73 | 21 | |
| Face Identification | Carl database | Rank-1 Acc71 | 12 | |
| Face Identification | NVESD LWIR-Visible LWIR probe images vs 2 visible image/subject gallery | Rank-1 Accuracy98 | 12 | |
| Rank-1 Identification | NVESD MWIR-Visible (test) | Rank-1 Identification Accuracy100 | 12 | |
| Thermal-to-visible Face Recognition | Carl (1/subject gallery) | Rank-1 Accuracy56.33 | 8 | |
| Thermal-to-visible Face Recognition | Carl 2 subject gallery | Rank-1 Acc60.08 | 8 | |
| Thermal-to-visible Face Recognition | Carl (all subject gallery) | Rank-1 Acc71 | 8 | |
| Thermal-to-visible Face Recognition | UND-X1 (2/subject) | Rank-1 Accuracy60.83 | 5 | |
| Thermal-to-visible Face Recognition | UND-X1 (all/subject) | Rank-1 Accuracy83.73 | 5 |