Towards Visual Explanations for Convolutional Neural Networks via Input Resampling

Lengerich, Benjamin J.; Konam, Sandeep; Xing, Eric P.; Rosenthal, Stephanie; Veloso, Manuela

Computer Science > Machine Learning

arXiv:1707.09641 (cs)

[Submitted on 30 Jul 2017 (v1), last revised 16 Aug 2017 (this version, v2)]

Title:Towards Visual Explanations for Convolutional Neural Networks via Input Resampling

Authors:Benjamin J. Lengerich, Sandeep Konam, Eric P. Xing, Stephanie Rosenthal, Manuela Veloso

View PDF

Abstract:The predictive power of neural networks often costs model interpretability. Several techniques have been developed for explaining model outputs in terms of input features; however, it is difficult to translate such interpretations into actionable insight. Here, we propose a framework to analyze predictions in terms of the model's internal features by inspecting information flow through the network. Given a trained network and a test image, we select neurons by two metrics, both measured over a set of images created by perturbations to the input image: (1) magnitude of the correlation between the neuron activation and the network output and (2) precision of the neuron activation. We show that the former metric selects neurons that exert large influence over the network output while the latter metric selects neurons that activate on generalizable features. By comparing the sets of neurons selected by these two metrics, our framework suggests a way to investigate the internal attention mechanisms of convolutional neural networks.

Comments:	Presented at ICML 2017 Workshop on Visualization for Deep Learning
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1707.09641 [cs.LG]
	(or arXiv:1707.09641v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1707.09641

Submission history

From: Benjamin Lengerich [view email]
[v1] Sun, 30 Jul 2017 17:12:20 UTC (926 KB)
[v2] Wed, 16 Aug 2017 14:02:23 UTC (925 KB)

Computer Science > Machine Learning

Title:Towards Visual Explanations for Convolutional Neural Networks via Input Resampling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Towards Visual Explanations for Convolutional Neural Networks via Input Resampling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators