Deep Neural Networks with Trainable Activations and Controlled Lipschitz Constant

Aziznejad, Shayan; Gupta, Harshit; Campos, Joaquim; Unser, Michael

doi:10.1109/TSP.2020.3014611

Computer Science > Machine Learning

arXiv:2001.06263 (cs)

[Submitted on 17 Jan 2020 (v1), last revised 7 Aug 2020 (this version, v2)]

Title:Deep Neural Networks with Trainable Activations and Controlled Lipschitz Constant

Authors:Shayan Aziznejad, Harshit Gupta, Joaquim Campos, Michael Unser

View PDF

Abstract:We introduce a variational framework to learn the activation functions of deep neural networks. Our aim is to increase the capacity of the network while controlling an upper-bound of the actual Lipschitz constant of the input-output relation. To that end, we first establish a global bound for the Lipschitz constant of neural networks. Based on the obtained bound, we then formulate a variational problem for learning activation functions. Our variational problem is infinite-dimensional and is not computationally tractable. However, we prove that there always exists a solution that has continuous and piecewise-linear (linear-spline) activations. This reduces the original problem to a finite-dimensional minimization where an l1 penalty on the parameters of the activations favors the learning of sparse nonlinearities. We numerically compare our scheme with standard ReLU network and its variations, PReLU and LeakyReLU and we empirically demonstrate the practical aspects of our framework.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2001.06263 [cs.LG]
	(or arXiv:2001.06263v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2001.06263
Related DOI:	https://doi.org/10.1109/TSP.2020.3014611

Submission history

From: Shayan Aziznejad [view email]
[v1] Fri, 17 Jan 2020 12:32:55 UTC (1,349 KB)
[v2] Fri, 7 Aug 2020 13:27:44 UTC (1,982 KB)

Computer Science > Machine Learning

Title:Deep Neural Networks with Trainable Activations and Controlled Lipschitz Constant

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Deep Neural Networks with Trainable Activations and Controlled Lipschitz Constant

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators