Understanding Neural Networks and Individual Neuron Importance via Information-Ordered Cumulative Ablation

Amjad, Rana Ali; Liu, Kairen; Geiger, Bernhard C.

doi:10.1109/TNNLS.2021.3088685

Computer Science > Machine Learning

arXiv:1804.06679 (cs)

[Submitted on 18 Apr 2018 (v1), last revised 9 Jun 2021 (this version, v4)]

Title:Understanding Neural Networks and Individual Neuron Importance via Information-Ordered Cumulative Ablation

Authors:Rana Ali Amjad, Kairen Liu, Bernhard C. Geiger

View PDF

Abstract:In this work, we investigate the use of three information-theoretic quantities -- entropy, mutual information with the class variable, and a class selectivity measure based on Kullback-Leibler divergence -- to understand and study the behavior of already trained fully-connected feed-forward neural networks. We analyze the connection between these information-theoretic quantities and classification performance on the test set by cumulatively ablating neurons in networks trained on MNIST, FashionMNIST, and CIFAR-10. Our results parallel those recently published by Morcos et al., indicating that class selectivity is not a good indicator for classification performance. However, looking at individual layers separately, both mutual information and class selectivity are positively correlated with classification performance, at least for networks with ReLU activation functions. We provide explanations for this phenomenon and conclude that it is ill-advised to compare the proposed information-theoretic quantities across layers. Furthermore, we show that cumulative ablation of neurons with ascending or descending information-theoretic quantities can be used to formulate hypotheses regarding the joint behavior of multiple neurons, such as redundancy and synergy, with comparably low computational cost. We also draw connections to the information bottleneck theory for neural networks.

Comments:	12 pages; accepted for publication in IEEE Transactions on Neural Networks and Learning Systems
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Machine Learning (stat.ML)
Cite as:	arXiv:1804.06679 [cs.LG]
	(or arXiv:1804.06679v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1804.06679
Journal reference:	IEEE Trans. Neural Networks and Learning Systems 33(12):7842-7852
Related DOI:	https://doi.org/10.1109/TNNLS.2021.3088685

Submission history

From: Bernhard C. Geiger [view email]
[v1] Wed, 18 Apr 2018 12:29:24 UTC (241 KB)
[v2] Thu, 11 Apr 2019 11:35:26 UTC (1,026 KB)
[v3] Wed, 3 Jul 2019 06:52:01 UTC (1,026 KB)
[v4] Wed, 9 Jun 2021 15:28:37 UTC (1,454 KB)

Computer Science > Machine Learning

Title:Understanding Neural Networks and Individual Neuron Importance via Information-Ordered Cumulative Ablation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Understanding Neural Networks and Individual Neuron Importance via Information-Ordered Cumulative Ablation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators