Scalable Model Compression by Entropy Penalized Reparameterization

Oktay, Deniz; Ballé, Johannes; Singh, Saurabh; Shrivastava, Abhinav

Computer Science > Machine Learning

arXiv:1906.06624 (cs)

[Submitted on 15 Jun 2019 (v1), last revised 16 Feb 2020 (this version, v3)]

Title:Scalable Model Compression by Entropy Penalized Reparameterization

Authors:Deniz Oktay, Johannes Ballé, Saurabh Singh, Abhinav Shrivastava

View PDF

Abstract:We describe a simple and general neural network weight compression approach, in which the network parameters (weights and biases) are represented in a "latent" space, amounting to a reparameterization. This space is equipped with a learned probability model, which is used to impose an entropy penalty on the parameter representation during training, and to compress the representation using a simple arithmetic coder after training. Classification accuracy and model compressibility is maximized jointly, with the bitrate--accuracy trade-off specified by a hyperparameter. We evaluate the method on the MNIST, CIFAR-10 and ImageNet classification benchmarks using six distinct model architectures. Our results show that state-of-the-art model compression can be achieved in a scalable and general way without requiring complex procedures such as multi-stage training.

Comments:	Published in ICLR 2020
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:1906.06624 [cs.LG]
	(or arXiv:1906.06624v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1906.06624

Submission history

From: Deniz Oktay [view email]
[v1] Sat, 15 Jun 2019 22:46:33 UTC (452 KB)
[v2] Fri, 1 Nov 2019 19:52:15 UTC (456 KB)
[v3] Sun, 16 Feb 2020 17:51:13 UTC (52 KB)

Computer Science > Machine Learning

Title:Scalable Model Compression by Entropy Penalized Reparameterization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Scalable Model Compression by Entropy Penalized Reparameterization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators