Human-Aided Saliency Maps Improve Generalization of Deep Learning

About

Deep learning has driven remarkable accuracy increases in many computer vision problems. One ongoing challenge is how to achieve the greatest accuracy in cases where training data is limited. A second ongoing challenge is that trained models oftentimes do not generalize well even to new data that is subjectively similar to the training set. We address these challenges in a novel way, with the first-ever (to our knowledge) exploration of encoding human judgement about salient regions of images into the training data. We compare the accuracy and generalization of a state-of-the-art deep learning algorithm for a difficult problem in biometric presentation attack detection when trained on (a) original images with typical data augmentations, and (b) the same original images transformed to encode human judgement about salient image regions. The latter approach results in models that achieve higher accuracy and better generalization, decreasing the error of the LivDet-Iris 2020 winner from 29.78% to 16.37%, and achieving impressive generalization in a leave-one-attack-type-out evaluation scenario. This work opens a new area of study for how to embed human intelligence into training strategies for deep learning to achieve high accuracy and generalization in cases of limited training data.

Aidan Boyd, Kevin Bowyer, Adam Czajka• 2021

Related benchmarks

Task	Dataset	Result
Binary Iris Presentation Attack Detection	Iris PAD (test)	MSE0.062	18
Iris Presentation Attack Detection	Iris Presentation Attack Detection (PAD) Open-set (test)	Performance Score (Printout)-0.0184	12
Presentation Attack Detection	Iris Presentation Attack Detection Open-set (test)	Printout Score0.0879	12

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord