Neural Network Pruning by Gradient Descent

Zhang, Zhang; Tao, Ruyi; Zhang, Jiang

Computer Science > Machine Learning

arXiv:2311.12526 (cs)

[Submitted on 21 Nov 2023 (v1), last revised 22 Nov 2023 (this version, v2)]

Title:Neural Network Pruning by Gradient Descent

Authors:Zhang Zhang, Ruyi Tao, Jiang Zhang

View PDF

Abstract:The rapid increase in the parameters of deep learning models has led to significant costs, challenging computational efficiency and model interpretability. In this paper, we introduce a novel and straightforward neural network pruning framework that incorporates the Gumbel-Softmax technique. This framework enables the simultaneous optimization of a network's weights and topology in an end-to-end process using stochastic gradient descent. Empirical results demonstrate its exceptional compression capability, maintaining high accuracy on the MNIST dataset with only 0.15\% of the original network parameters. Moreover, our framework enhances neural network interpretability, not only by allowing easy extraction of feature importance directly from the pruned network but also by enabling visualization of feature symmetry and the pathways of information propagation from features to outcomes. Although the pruning strategy is learned through deep learning, it is surprisingly intuitive and understandable, focusing on selecting key representative features and exploiting data patterns to achieve extreme sparse pruning. We believe our method opens a promising new avenue for deep learning pruning and the creation of interpretable machine learning systems.

Comments:	21 pages, 5 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2311.12526 [cs.LG]
	(or arXiv:2311.12526v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.12526

Submission history

From: Zhang Zhang [view email]
[v1] Tue, 21 Nov 2023 11:12:03 UTC (2,849 KB)
[v2] Wed, 22 Nov 2023 09:39:02 UTC (2,875 KB)

Computer Science > Machine Learning

Title:Neural Network Pruning by Gradient Descent

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Neural Network Pruning by Gradient Descent

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators