Robust Training of Neural Networks at Arbitrary Precision and Sparsity

Ye, Chengxi; Chu, Grace; Liu, Yanfeng; Zhang, Yichi; Lew, Lukasz; Howard, Andrew

Computer Science > Machine Learning

arXiv:2409.09245v1 (cs)

[Submitted on 14 Sep 2024]

Title:Robust Training of Neural Networks at Arbitrary Precision and Sparsity

Authors:Chengxi Ye, Grace Chu, Yanfeng Liu, Yichi Zhang, Lukasz Lew, Andrew Howard

View PDF HTML (experimental)

Abstract:The discontinuous operations inherent in quantization and sparsification introduce obstacles to backpropagation. This is particularly challenging when training deep neural networks in ultra-low precision and sparse regimes. We propose a novel, robust, and universal solution: a denoising affine transform that stabilizes training under these challenging conditions. By formulating quantization and sparsification as perturbations during training, we derive a perturbation-resilient approach based on ridge regression. Our solution employs a piecewise constant backbone model to ensure a performance lower bound and features an inherent noise reduction mechanism to mitigate perturbation-induced corruption. This formulation allows existing models to be trained at arbitrarily low precision and sparsity levels with off-the-shelf recipes. Furthermore, our method provides a novel perspective on training temporal binary neural networks, contributing to ongoing efforts to narrow the gap between artificial and biological neural networks.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
Cite as:	arXiv:2409.09245 [cs.LG]
	(or arXiv:2409.09245v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2409.09245

Submission history

From: Chengxi Ye [view email]
[v1] Sat, 14 Sep 2024 00:57:32 UTC (158 KB)

Computer Science > Machine Learning

Title:Robust Training of Neural Networks at Arbitrary Precision and Sparsity

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Robust Training of Neural Networks at Arbitrary Precision and Sparsity

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators