Training Deep Networks without Learning Rates Through Coin Betting

Orabona, Francesco; Tommasi, Tatiana

Computer Science > Machine Learning

arXiv:1705.07795 (cs)

[Submitted on 22 May 2017 (v1), last revised 4 Nov 2017 (this version, v3)]

Title:Training Deep Networks without Learning Rates Through Coin Betting

Authors:Francesco Orabona, Tatiana Tommasi

View PDF

Abstract:Deep learning methods achieve state-of-the-art performance in many application scenarios. Yet, these methods require a significant amount of hyperparameters tuning in order to achieve the best results. In particular, tuning the learning rates in the stochastic optimization process is still one of the main bottlenecks. In this paper, we propose a new stochastic gradient descent procedure for deep networks that does not require any learning rate setting. Contrary to previous methods, we do not adapt the learning rates nor we make use of the assumed curvature of the objective function. Instead, we reduce the optimization process to a game of betting on a coin and propose a learning-rate-free optimal algorithm for this scenario. Theoretical convergence is proven for convex and quasi-convex functions and empirical evidence shows the advantage of our algorithm over popular stochastic gradient algorithms.

Comments:	Camera-ready version for NIPS 2017
Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:1705.07795 [cs.LG]
	(or arXiv:1705.07795v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1705.07795

Submission history

From: Francesco Orabona [view email]
[v1] Mon, 22 May 2017 15:04:05 UTC (153 KB)
[v2] Tue, 30 May 2017 17:21:27 UTC (87 KB)
[v3] Sat, 4 Nov 2017 21:19:04 UTC (304 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2017-05

Change to browse by:

cs
math
math.OC
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Francesco Orabona
Tatiana Tommasi

export BibTeX citation

Computer Science > Machine Learning

Title:Training Deep Networks without Learning Rates Through Coin Betting

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Training Deep Networks without Learning Rates Through Coin Betting

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators