Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

This Looks Like That: Deep Learning for Interpretable Image Recognition

About

When we are faced with challenging image classification tasks, we often explain our reasoning by dissecting the image, and pointing out prototypical aspects of one class or another. The mounting evidence for each of the classes helps us make our final decision. In this work, we introduce a deep network architecture -- prototypical part network (ProtoPNet), that reasons in a similar way: the network dissects the image by finding prototypical parts, and combines evidence from the prototypes to make a final classification. The model thus reasons in a way that is qualitatively similar to the way ornithologists, physicians, and others would explain to people on how to solve challenging image classification tasks. The network uses only image-level labels for training without any annotations for parts of images. We demonstrate our method on the CUB-200-2011 dataset and the Stanford Cars dataset. Our experiments show that ProtoPNet can achieve comparable accuracy with its analogous non-interpretable counterpart, and when several ProtoPNets are combined into a larger network, it can achieve an accuracy that is on par with some of the best-performing deep models. Moreover, ProtoPNet provides a level of interpretability that is absent in other interpretable deep models.

Chaofan Chen, Oscar Li, Chaofan Tao, Alina Jade Barnett, Jonathan Su, Cynthia Rudin• 2018

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-10 (test)
Accuracy67.8
3381
Image ClassificationMNIST (test)
Accuracy94.7
894
Image ClassificationStanford Cars
Accuracy84.5
635
Image ClassificationSVHN (test)
Accuracy88.6
401
Image ClassificationCUB-200 2011
Accuracy81.1
356
Fine-grained Image ClassificationCUB-200 2011
Accuracy84
300
Image ClassificationCUB-200-2011 (test)
Top-1 Acc80.1
286
Fine-grained Image ClassificationStanford Cars
Accuracy89.5
284
Image ClassificationMNIST (test)
Accuracy99.4
196
Fine-grained Image ClassificationStanford Dogs (test)
Accuracy77.3
124
Showing 10 of 33 rows

Other info

Follow for update